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Preface 



Cells are the elemental living units of all biological systems. They can form 
unicellular organisms. And they can form complex multicellular organisms through 
the interaction of a huge number of individual cells. Understanding the biology of 
the cell is the key to understanding life in its simplest and its most highly developed 
form. Biological research has already provided a high degree of understanding of 
many cellular processes. The detailed knowledge of these processes is the result of 
the astonishing development of experimental techniques in the last few decades. 
The accumulation of this knowledge has led to a change in the focus of research 
towards trying to understand the interaction between cellular processes. This has 
revealed the existence of highly interdependent networks within cells that seem 
impossible to comprehend using traditional biological approaches. 

As a response to this wealth of data and complexity of life, current biological 
research increasingly relies on mathematical methods in formulating results and 
analyzing experiments. The formulation of ideas in terms of mathematical models 
allows for an integrated perception and can give structure to complex data. The 
understanding of the dynamic interaction of components is typically only possi- 
ble in mathematical terms. Furthermore, mathematics helps to identify general 
principles and the predictions of mathematical models may serve as a guideline 
for further research. The importance of the contributions of mathematics and 
theoretical physics to biology is evidenced by the rise of new disciplines such as 
Computational Cell Biology and Systems Biology, and by the launch of new pro- 
grams to fund the interaction of experimental and mathematical analyses. This 
book documents part of this development in the field of cell biology. It focuses 
on structural, dynamical and functional aspects of cellular systems and presents 
corresponding experiments and mathematical models. 

We hope that this highly-interdisciplinary book may serve as an introduction 
for biologists, mathematicians and physicists to key questions in cellular systems 
that can be studied with mathematical models. Recent model approaches are pre- 
sented with applications in cellular metabolism, intra- and intercellular signaling, 
cellular mechanics, network dynamics and pattern formation. In addition, applica- 
tions of these approaches to tumor biology, the immune system and biotechnology 
are included. 

This book is based on selected peer-reviewed contributions and discussions 
at the “First International MTBio Workshop on the Function and Regulation of 
Cellular Systems: Experiments and Models” (Dresden, June 24-30, 2001). The in- 
ternational competence and information network MTBio (Modeling and Theory in 
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the Biosciences, www.mtbio.de) has been founded recently to improve communi- 
cation between experimentalists and theoreticians who are interested in applying 
mathematical models to biological problems. 

This book is only possible due to the excellent intellectual environment in 
the Center for High Performance Computing (ZHR, Dresden University of Tech- 
nology) provided by its head Professor Wolfgang Nagel, and in particular to the 
tremendous support and help of numerous colleagues from the ZHR, especially 
Thomas Kriebitzsch, Claudia Schmidt and Felix Sinne. The authors also thank 
the Max Planck Institute for the Physics of Complex Systems (MPI-PKS), Dres- 
den, for its hosting the Workshop and providing ongoing support for the MTBio 
network. Particularly, Katrin Lantsch from the MPI-PKS guest programme has 
helped tremendously in the workshop organisation and book preparation. In addi- 
tion to the authors, we would like to thank several other colleagues who contributed 
to the quality of the book by reviewing chapter submissions. 

A. Deutsch, M. Falcke, J. Howard, W. Zimmermann 

Berlin, Dresden, Saarbriicken, February 2003 
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Introduction to Part I 

Reinhart Heinrich, Thomas Hdfer 

Institut fiir Biologie, Humboldt-Universitat zu Berlin, Germany 

Modelling of metabolic networks is one of the traditional fields of mathe- 
matical biology. Initially, modelling concerned mainly glycolysis and also now, 30 
years later, this pathway is one of the favoured subjects for theoretical investiga- 
tion. However, mathematical modelling has been successfully performed also for 
other metabolic networks in a multitude of cell types such as the pentose phos- 
phate pathway, the citric acid cycle, or photosynthesis and amino acid synthe- 
sis. Moreover, the interaction between metabolism and membrane transport and, 
more recently, interrelations with signal transduction and gene expression have 
been studied. Mathematical modelling of metabolic networks became also a tool 
for clinical studies by quantifying the effects of enzyme deficiencies or by elucidat- 
ing the interactions of parasites with their host cells. Furthermore, modelling is 
already now helpful in the field of metabolic engineering by identifying targets of 
genetic manipulation for increasing the yield of biosynthetic products. 

The main problems which have been addressed by mathematical modelling 
are the following: (1) Simulation of the dynamic behaviour of a whole pathway; 

(2) Identification of enzymatic steps which exert strong control on metabolism; 

(3) Elucidation of the role of nonlinear regulatory couplings; (4) Understanding 
the stoichiometric and kinetic design of pathways on the basis of evolutionary 
optimization; (5) Identification of metabolic subsystems which may act in a quasi- 
independent manner as metabolic pathways in producing functionally important 
end products. 

A prerequisite for the development of simulation models is the knowledge 
of the kinetic properties of the individual enzymes in terms of rate equations as 
well as the existence of data for the system dynamics under stationary or time- 
dependent conditions. For a number of cells sufficient experimental information 
for that purpose has been accumulated. The best studied systems are the energy 
metabolisms of erythrocytes, of yeast and of E. coli. For these cells, mathematical 
modelling became also a guide to pinpoint enzymatic reaction steps for which 
additional data are required or to propose critical experiments whose outcomes 
may be compared with the theoretical predictions. 

Research in metabolic systems has gained from the development of a concept 
called metabolic control analysis (MCA). The extent to which the various steps 
of a metabolic pathway affect metabolic fluxes or the concentrations of metabolic 
intermediates are quantified in a rigorous way by control coefficients of enzymes 
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and by elasticity coefficients for their reaction rates. This analysis paved the way 
to understand the behaviour of whole systems by the properties of the individual 
steps and by the stoichiometric characteristics of the networks. Several rules for 
distributed control have been discovered which led to the formulation of general 
summation and connectivity theorems. 

Among the nonlinearities contributing to the complexity of metabolic net- 
works, feedback inhibitions and other regulatory loops are of utmost importance. 
Whereas some of these non-stoichiometric couplings act in favour of a homeostasis 
of metabolic function, others are responsible for complicated dynamic phenomena 
such as oscillations, bistability or chaos. In fact, metabolic oscillations are widely 
observed, and their theoretical and experimental investigations gave many insights 
into the biochemical mechanisms of biorhythms in general. 

In contrast to chemical reaction systems of inanimate nature enzyme cat- 
alyzed reaction systems are the outcome of biological evolution and in this way 
of natural selection. Accordingly, it is generally assumed that metabolic pathways 
show some fitness properties. Viewing evolution as an optimization process and 
investigating metabolism on the basis of extremum principles turned out to be 
essential for the understanding of the design of biochemical reaction networks. 
Mathematical analysis of this interesting subject is still at the very beginning but 
several aspects have been studied, such as the optimal distributions of enzyme con- 
centrations along pathways, optimal values for Michaelis-Menten constants, and 
optimal stoichiometric arrangements of reactions. Future studies in these direc- 
tions will help to understand the robustness of metabolic pathways and of living 
matter in general. 

For many cell types mathematical modelling of metabolic dynamics seems 
to be premature since the rate equations of their enzymes and numerical values 
of kinetic parameters are still unknown. However, a special kind of mathematical 
analysis proved to be useful which is solely based on the stoichiometric properties 
of the networks which are often better known. Stoichiometric relations give rise 
to constraints for flux distributions which are, for example, of relevance for the 
maximization of the metabolic yield in the held of biotechnology. A thorough 
stoichiometric analysis has led to the concept of elementary networks, that is, of 
pathways acting with a minimum number of steps for the conversion of initial 
substrates into end products. In this way, mathematical analysis was helpful to 
clarify the concept of metabolic pathways which are usually defined in a more 
intuitive way. 

The present chapter collects four papers covering various topics of metabolic 
modelling mentioned above. Hyime, Dan0 and S0rensen present a detailed simu- 
lation model of yeast glyolysis with special emphasis on metabolic dynamics in an 
open flow reactor. This includes steady state analysis, investigation of glyolytic os- 
cillations and thorough parameter estimations. Cellular oscillations are also stud- 
ied in the paper of Ruoff. His special focus is temperature-compensation which 
is essential for oscillations when acting as biological clocks, e.g., the biochemical 
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mechanisms underlying circadian rhythms. In particular, he derives necessary con- 
ditions for the occurrence of temperature-compensation. Jeneson, WesterhoflF and 
Kushmerick apply metabolic control analysis to the ATP free energy metabolism 
of skeletal muscle. Special attention is paid to rate control exerted by ATPases 
and the effects of calcium ions. They also make use of an extended version of 
MCA, called regulation analysis. Forster, Akesson and Nielsen give an overview 
on various kinds of steady state analyses such as metabolic flux analysis. 
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A Functional Dynamics Approach to Modelling 
of Glycolysis 

Finn Hynne, Sune Dan0 and Preben G. S0rensen 

University of Copenhagen, Department of Chemistry, H.C. 0rsted Institute, 

Universitet spar ken 5, 2100 Copenhagen, Denmark 



1. Introduction 

The access to ever increasing computer power as well as the availability of huge 
amounts of raw data from high-throughput experimental techniques in genomics 
and proteomics have renewed interest in the modelling of biosystems. The dream 
is in silicio biology where computer models have become so sophisticated that 
answers to biological relevant questions can equally well but with much less 
effort be given by the computer as by the cell. 

A number of current research projects have very ambitious goals. Some in- 
tend to model huge and complex biosystems quantitatively, while others aim at 
reconstructing the reaction network of an entire cell from gene expression data. 
We, however, have a more humble goal in the research presented here. In an effort 
to take biochemical modelling as far as possible in terms of realism, we have lim- 
ited ourselves to the very well defined system of glycolysis in yeast cells showing 
metabolic oscillations [1]. For this particular experimental system there is a host 
of data avaliable describing both enzymatic, metabolic and dynamic properties. 
The aim of our modelling is to achieve quantitative agreement with all available 
experimental findings for this particular system. 

If the ultimate goal is in silicio biology for an entire cell, then why is it 
interesting to model a small fraction of yeast metabolism? The answer hats three 
parts. 

One is that glycolysis in a yeast cell is one of the simplest mechanisms by 
which a cell can remain alive, so that the faithfulness which can be reached mod- 
elling this system sets an upper limit of what can be hoped for by complete cell 
modelling in more complex situations. 

The modelling of this system using conventional methods is still a formi- 
dable task. It is of primary importance that we have developed a new method 
for the optimization of biochemical models which reduces the size of the parame- 
ter fitting problem by exploiting constraints imposed by stoichiometry and mass 
fiow balances. Also the computational efficacy is increased by including system 
dynamics in the fitting. We expect that the method presented here is relevant in 
a much broader context than yeast glycolysis. Still, the method cannot handle 
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the entire cell if realistic modelling in terms of biochemical reactions is intended. 
The paramount problem is that the size of the parameter space that needs to be 
searched in order to find a parameter set which agrees with experiments, grows 
exponentially with the number of unknown parameters. As a consequence, even 
the fastest computers will never be able to solve much larger systems. Therefore, 
the availability of expression profiles for the around 6000 genes in Saccharomyces 
cerevisiae does not imply that one has ample data to model its genetic regulation. 
It merely refiects the fact that the entire yeast cell is an extremely complicated 
entity. 

From a more philosophical point of view, one could argue that there is a 
context hierarchy in biology: function (dynamics), energy and mass flow (metabo- 
lism), reactions (enzymes), and templates (genes). This implies that metabolism is 
closer to the biological function than the genome, and that even though the genes 
are important, they do not constitute a blueprint for cellular function. Therefore, 
it might be more fruitful to model metabolism than genomics, especially because 
the complexity of the metabolome is much lower than that of the genome (600 
metabolites vs. 6000 genes in Saccharomyces cerevisiae), but also because the 
problem becomes more tractable by exploiting physical and chemical constraints 
on metabolism. 

The work described here involves a new fitting procedure, new ways of com- 
bining mechanistic and dynamical data, as well as a lot of biochemical and ex- 
perimental considerations and comparisons. With the space avaliable, we cannot 
possibly account for the details of the rigorous mathematics underlying the fitting 
procedure, nor can we give a detailed account of the use of experimental data. For 
such information, the reader should consult [2] which is an in-depth description of 
the present work. 



2. The Biological System 

The system which we intend to model is yeast cells in an open flow reactor (a 
CSTR) with constant inflow of glucose, cyanide and cell suspension and outflow 
of surplus liquid. This setup allows us to control the state of the cells precisely 
by controlling the rates of the various inflows, and the particular operating point 
which we model corresponds to the onset of sustained oscillations as the inflow of 
glucose is increased. By a combination of such manipulations and the use of spe- 
cial perturbation experiments we have characterized the onset of oscillations at the 
operating point as the emergence of almost simultaneous supercritical Hopf bifur- 
cations in each of the yeast cells [2,3]. The significance of this observation is that 
the yeast cell dynamics follow a simple equation which describes the behaviour 
of systems close to this type of bifurcation. The close correspondence between a 
solution to this equation the Stuart-Landau equation and the cell behavior is 
illustrated by Fig. 1, where the Stuart-Landau equation is fitted to two pertur- 
bation experiments, where either glucose or acetaldehyde is added in preciesely 
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the unique amount and unique phase that causes a momentary quenching of the 
oscillations. This characterization of the dynamics is a big advantage for the pa- 
rameter fitting, since it provides us with a tractable mathematical framework for 
the comparison between the dynamics of the model and the dynamics of the yeast 
cells. 




216 219 222 225 




Figure 1. Fit of a Stuart-Landau equation (black curves) to two 
different quenching experiments (grey curves), where the instan- 
taneous addition of glucose (a) and acetaldehyde (b) causes a mo- 
mentary quenching of the oscillations. Experimental data from [3], 
see [4] for details of the fitting. 

Furthermore, the yea-st cells are known to be strongly coupled [5 7], so we can 
assume that the behaviour of a single typical yeast cell can be inferred from that 
of the entire population. Obviously, this is a big advantage during the parameter 
fitting. 

The experimental design helps to reduce the complexity of the biological sys- 
tem considerably. The cells in the experiments are starved beforehand, so they 
only have a limited amount of amino acids avaliable for protein synthesis. This 
ensures that the cells are non-growing, and that the enzyme composition of the 
yeast cells does not change significantly in the rather short time course of the ex- 
periments. Furthermore, the presence of cyanide effectively blocks the respiratory 
part of the metabolism, so we are left with glycolysis and fermentation plus a 
branch for glycerol production and a branch for glycogen buildup. This reaction 
network is shown in Fig. 2. 

Apart from working with a well defined system, it is of utmost importance to 
have as much experimental data avaliable as possible. As described in Ref. [2], one 
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Figure 2. The reaction network of the model. Reactions 
2,13,16, and 18 are membrane transport reactions and reactions 
1,14,17,19, and 21 are in/out flows of the CSTR (For colored pic- 
ture see color plate 1). 



can combine the CSTR experiments with experiments performed on the oscillating 
transients observed when first a glucose pulse and then a cyanide pulse is added 
to a suspension of yeast cells (transient experiments; [7 9]). The combined data 
set has data on the period of oscillations, on 13 metabolite concentrations, on 
the glycolytic flux and on its branching ratios. We also have measurements of 
nine phases and 13 amplitudes of metabolites during the oscillations, data on 
the mechanical operating conditions of the CSTR and the precise location of the 
Hopf bifurcation, as well as the period of oscillations and data from the quenching 
experiments. We also have in vitro data on 24 enzyme kinetic parameters. All of 
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these data were included in the parameter fitting of the model as described in the 
following section. 



3. Development of the Model 

The first thing to consider when developing a model is the type of equations. It 
is by far most convenient to work with a set of ordinary differential equations, 
but that might not always give an adequate description of the system. There are 
two issues to consider: Is the number of molecules big enough, and is the system 
spatially homogeneous? 

In metabolism one is generally in the milimolar range of concentrations, so 
within a typical yeast cell of say 6 fim diameter one will find N ^ 10^ for each of 
the metabolites. Stochastic fluctuations are on the order of \/]V 3 • 10'^ which is 

less than Tyoo of N. However, one might run into trouble when modelling other 
cellular systems, notably the regulatory network of the genome. 

Intracellular spatial homogeneity is ensured by diffusion if the volume of the 
cell is small enough. That is, the characteristic time for diffusive mixing in the spa- 
tial compartment should be less than the characteristic time of the dynamics in 
question. Again, for a typical yeast cell we have ^„iix ~ < 0.1s when the diffu- 

sion constant D is taken to a typical value for metabolites in water, 5- 10“^'cm^/s. 
This seems small when compared to the 35 40s period of the observed oscilla- 
tions, Nevertheless, travelling NADH waves have been observed in neutrophiles 
which are elongated cells with a typical length of 20 /im [10], so in general one 
cannot rule out the possibility of spatial heterogeneity. Still, the wavelength of the 
NADH waves observed in that study is so large that they would not fit in a yeast 
cell. Consequently, we will assume that the yeast cells are spatially homogeneous. 

We will assume that the enzyme activity is the same for all experiments 
used in the parameter fitting. Note that this only makes sense if the experimental 
values are obtained for identical cells which are truly in the same proteomic state 
and that this state remains constant during the experiments. Closer inspection 
shows that the cells are not completely identical but have for example a size 
distribution. If the distribution in properties is narrow the dynamics is almost 
identical with the dynamics for a system with identical cells. A discussion of this 
synchronization problem can be found in [4] . The time constancy of cell properties 
is only approximately valid in the transient experiments, but is confirmed for the 
CSTR through the remarkable long-time stability of the oscillatory amplitude [3]. 

Since our goal is a quantitatively and biochemically realistic description of the 
yeast cell experiments, we formulate our model so that direct comparisons between 
experimental findings and model predictions can be made. The intention is to make 
the model as simple as possible, but nevertheless the need for comparison with 
experiments makes it rather detailed with 24 reactions (of which 12 are reversible), 
22 metabolites and 59 parameters. (Details of the model can be inferred from Fig. 2 
and Table 1 or found in [2].) Fig. 3 depicts the system described by the model. 
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Figure 3. Schematic figure of a yeast cell in a CSTR. In the real 
experiments the reactor contained 1 billion cells. The dynamics 
of a one-cell reactor is identical to the dynamics of a many-cell 
reactor provided the state af the cells are identical, and the fiow 
rate and volume is increased in proportion with the number of 
cells. 

In the model we assume that the functional forms of the rate expressions are 
known for each enzyme for example from in vitro studies. What remains to be 
chosen is the values of the kinetic parameters in the rate expressions. The kinetic 
parameters is a collection of rate constants, maximum velocities, Michaelis-Menten 
parameters and other parameters. Most of these constants have been determined 
in in vitro experiments on isolated enzymes. We have, however, no guarantee that 
the in vitro values are applicable when the enzymes are embedded in a living cell. 
The maximum velocities for the enzymes in the living cell depend on the enzyme 
activities and these values are largely unknown. 

In principle, we want to test all possible parameter combinations and pick 
the one which gives the best match with experiments. However, this is not possible 
since the number of possible parameter combinations is astronomical. So, we need 
to decrease the size of the parameter fitting task. The basic observation, which 
allows us to do this, is the fact that the behaviour of a chemical reaction system is 
determined by the stoichiometry of its reactions and the kinetics of each of these. 
The connection between stoichiometry and concentrations is given by 

dCt, ^ 

= 2_^ ^svVr or in matrix notation c = iz • v (1) 

V 

where c.s is the concentration of the metabolite s, Vr is the velocity of reaction r, 
and Usr is the stoichiometric coefficient of metabolite s in reaction r. This equation 
simply states that the total change in the concentration of a given metabolite 
is the sum of the changes caused by the individual reactions, and that each of 
these contributions is given by the velocity of that reaction multiplied by the 
stoichiometric coefficient for that reaction. In matrix notation the time derivatives 
of the concentrations are collected in a vector c, the reaction velocities are collected 
in a vector v, and the stoichiometric coefficients are collected in a matrix u. Our 
first trick is to work only with the stationary states of the reaction system. Since 
we want to model the yeast cell system at the point where the stationary state 
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Table 1. Selected rate equations of the model. The remaining 
reactions are modelled either by standard Michaelis-Menten like 
rate equations or by mass action kinetics as indicated by the 
colour coding in Fig. 2 (For colored picture see color plate 2). 



becomes unstable and oscillations emerge, this is a fully valid description of the 
experimental system. For stationary states we have c = 0, so these states can be 
found from Eq. (1) as the null space of u which is the space of all vectors v for 
which 1/ • V = 0. Geometrically, the relevant part of velocity space is a polygonal 
cone (colloquially: a pyramid of infinite height) extending from origo, and it is 
generally of smaller dimension than the velocity space. The cone is known as the 
current cone, and the velocity vectors defining the edges of the cone are known as 
the extreme currents. These concepts are illustrated in Fig. 4. 
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Figure 4. Sketch of a two-dimensional current cone in a three- 
dimensional velocity space. v\^ and are the velocities of 
three reactions, and Ei and E 2 are the extreme currents defin- 
ing the edges of the current cone which is shaded grey. These 
edges appear because all velocities are non negative. In this fic- 
titious three- reaction system, stationary states are only found on 
the two-dimensional current cone. 



Chemical kinetics must also be included in the description in order to make 
it complete. The rate of a chemical reaction r can generally be written as 

n, = V,^,(c,K,) (2) 

where is the “velocity parameter” of the reaction, c is a vector containing 
the concentrations of the metabolites of the reaction system, and K,. is a vec- 
tor containing the “intrinsic parameters” of the reaction (z.e. all the remain- 
ing parameters after the velocity parameter has been factored out). Consider, 
for example, the rate equation of reaction 5 (Tab. 1 ). Here Vr, = Vr,m, ^5 = 

[FGP]'^/ (^1 + /^D ^ F [F 6 P]^^ , and K 5 is the set of /Cr, and /^r,. The split- 

ting of the parameters into velocity parameters and intrinsic parameters makes it 
straight forward to take advantage of the possibility to calculate stationary veloc- 
ities from the stoichiometry of the reaction system. We pick a set of stationary 
velocities v, a set of intrinsic parameters K^. for each of the reactions, and any 
set of concentrations c which we want to become stationary concentrations of the 
model. From these we can now calculate the intrinsic parameters of all the reac- 
tions as Wr — '^r/^r(c, K^-). This way we fit one parameter per reaction by invoking 
the stationarity condition, and we also obtain a model which by construction 
has a stationary state with the experimentally determined concentrations. This is 
achieved solely through algebraic operations, which are extremely efficient when 
compared to the usual fitting procedures involving numeric integration of the ki- 
netic equations or repeated Newton iterations of the rate expressions describing 
the reaction system. For biochemical systems, the direct calculation of the velocity 
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parameters is very convenient, since these parameters are inherently difficult to 
measure. 

When considering only stationary states, one might think that the dynamical 
properties of the system cannot be included in the parameter fitting. This is, 
however, not the case. The dynamics near a stationary state can be described 
quite accurately by a linearization of the kinetics around this state. Furthermore, 
by choosing the operating point for the parameter fitting as the point where the 
stationary state loses stability and the oscillations emerge, we have in a sense a 
stationary state and an oscillatory state at the same time. So, in the course of 
the parameter fitting we calculate the frequency of the oscillations, the relative 
amplitudes of the different metabolites, their relative phases, and the parameters 
characterizing the quenching responses show in Fig. 1 from the linearized kinetics 
of the stationary state. These system properties are then included in the model 
validation. Again, the calculations involved are algebraic operations, so they do not 
consume excessive amounts of computer power as in the classical methods based 
on integration of the kinetic equations. (Technically, the dynamical properties are 
calculated from the eigenvalues and the left and right eigenvectors of the Jacobian 
matrix.) 

With the procedure described above, we can evaluate the agreement with 
experiments for a given set of parameters in an extremely efficient way. Therefore, 
we can easily scan several million parameter combinations. Furthermore, we have 
a rather large collection of experimental data on the system as described at the 
end of Section 2. The data on the size of the glycolytic flux and its branching 
ratios together with the operating conditions of the CSTR are used to pick re- 
alistic stationary velocities, which are then plugged into Eq. (2) along with the 
known metabolite concentrations and the known enzyme kinetic parameters. These 
values are then supplemented with values for the unknown enzyme kinetic parame- 
ters and values for the metabolite concentrations which have not been determined 
experimentally. The data describing the relative phases and amplitudes of the 
metabolites during the oscillations, the frequency of oscillations, the special per- 
turbation experiments of Fig. 1 as well as the type and direction of the bifurcation 
are used in the model validation. Even with the procedure presented here, it is 
impossible to do a full scan of all relevant parameter combinations. But it is possi- 
ble to try out a large number of parameter combinations, so with suitable human 
guidance it was possible to arrive at a model which is biochemically realistic and 
in quantitative agreement with almost all experimental observations. 



4. Results of the Optimization 

The optimized parameter set was selected after an evaluation of more than 100 
million parameter combinations. An in depth discussion of the achieved results 
can be found in [2]. Only the main points are summarized here. The values of 
the parameters for the operating conditions of the reactor such as the glycolytic 
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Cs /mM 


ds/dNADH 


6s /deg 


Qs / Q AC 


0s /deg 


Glc, 


1.55 (1.6) 


0.013 


135 


5.3 (11) 


355 (4) 
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0.57 


1.83 


12 


18 


81 
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4.2 (4.1) 


15.8 (21) 


190 (260) 


1.7 


67 
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0.49 (0.5) 
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178 (250) 


1.7 
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4.4 


218 
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0.295 (0.04) 


30 


7.0 


255 


DHAP 


2.95 (2.5) 


6.97 (0.5) 


38 


7.9 


195 
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0.0003 (n.d) 


0.002 


136 


0.53 


287 


PEP 


0.04 (0.04) 


0.023 (0.07) 


18 


1.1 


286 


Pyr 


8.7 (8.7) 


4.06 (7) 


79 


125 


180 


ACA 


1.48 


0.894 


196 


2.5 


268 


EtOHx 


16.5 


0.035 


114 


oo (n.p) 


undef 


Eton 


19.2 


1.22 


26 


oo 


undef 


Glyc 


4.2 


1.68 


98 


(X) 


undef 


Glyc, 


1.68 


0.005 


188 


(X) (n.p) 


undef 


ACA^ 


1.29 


0.037 (0.3) 


284 (200) 


1(1) 


181 (172) 


GN- 


5.2 


5xl0"'’ 


193 


2400 (n.p) 


271 


ATP 


2.1 (2.1) 


10.8 (8) 


139 (180) 


0.50 


289 


ADP 


1.5 (1.5) 


6.32 (9.4) 


319 (0) 


1.0 


290 


AMP 


0.33 (0.33) 


4.5 (3.6) 


319 (0) 






NADH 


0.33 (0.33) 


1 (1) 


0(0) 


0.68 


106 


NAD+ 


0.65 (0.65) 


1 (0.6) 


180 (180) 







Table 2. Comparison of model results with experimental results 
related to metabolites: concentrations c.s of metabolites s, ampli- 
tudes of oscillations a.s in units of the NADH amplitude, phases 
of oscillations 6s relative to the phase of NADH, quenching con- 
centrations Qs in units of the AC Ax quenching concentration, and 
quenching phases (f)s relative to the phase of NADH. Experimental 
results are quoted in parenthesis. The origin of the experimental 
results are described in [2]. n.d.: not detectable, n.p.: quenching 
was attempted but was not possible. This corresponds to a large 
value of <7.s/<7ac:a,- 



flux were in complete agreement with the actual operating conditions. We also ob- 
tained exact agreement with experimental estimates for the branching coefficients 
for fermentation, glycerol production, lactonitril formation and glycogen buildup. 
Exact agreement between the model and the experiments was also obtained for 
the location and type of the bifurcation, just as the period of oscillations was fixed 
to the measured value of 37.5s. Table 2 shows a comparison of model values with 
experimental values for metabolite concentrations c^, oscillation amplitudes a^, 
oscillation phases Og, quenching concentrations and quenching phases 0s. 
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All the stationary concentrations agrees quite well with the experimental 
determinations. The same is true for most of the amplitudes of the oscillations. 
Noteworthy exceptions are the amplitude of GAP and DHAP. The fit of the phases 
are less satisfactory. Note the good agreement for the quenching amplitude ratio 
and quenching phases of Glcx and AC Ax. The quenching additions are made to the 
extracellular medium. Together with the fast cellular response to an instantaneous 
perturbation shown in Fig.l they demonstrate that the transport of Glc and AC A 
through the membrane is fast compared with the period of oscillation. Perhaps 
the most interesting result from the parameter fitting is that we have obtained a 
determination of the activities for the glycolytic enzymes inside a living cell based 
on experimental measurements. More tables with comparisons are given in Ref. [2]. 
The data in one of these tables demonstrates that the values for the Michaelis- 
Menten constants for the selected parameter point agrees remakably well with 
literature values. The model also compares well with experimental behavior at 
other operating points. 



5. Discussion and Perspectives 

We have presented a powerful, general method, based on functional dynamics, 
for fitting of kinetic parameters of a model for an entire pathway in a living cell 
to experimental data. All experiments are made at a fixed stationary state un- 
der experimental conditions where it can be assumed that the enzyme activities 
are constant. Rate constants and maximum velocities are determined by simple 
algebra without numeric integration of the kinetic equations. The model agrees 
with almost all experimental observations and data for Saccharomyces cerevisiae 
at an operating point where metabolic oscillations have just emerged. Beisically, 
the method can be used for other systems at a stationary state where substantial 
experimental material for stationary and dynamic properties is available. The use 
of measured dynamical properties for small amplitude deviations from the station- 
ary state is a new and important addition to the efficacy of the fitting procedure. 
Fundamentally the method does not depend on the existence of small amplitude 
oscillations. By using the method on stationary states of different environmental 
conditions and for mutant organisms quantitative information can be obtained 
about the metabolome which is at the heart of biological function. 
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1. Introduction 

In order to function as physiological clocks, circadian (and certain ultradian) 
rhythms contain homeostatic mechanisms which compensate for external tem- 
perature variations and other environmental influences. In this paper a theory for 
temperature-compensation and general homeostasis for physiological clocks is pre- 
sented and compared with experimental findings. Results obtained from different 
organisms indicate that circadian pacemakers are based on one or several negative 
feedback loops where protein products of clock genes act as inhibitors of their own 
transcription. We have simulated the occurrence of temperature-compensation by 
using a simple reaction-kinetic model (the so-called Goodwin oscillator) which 
mimicks the negative feedback loop of a circadian pacemaker. The comparison 
between simulation calculations and experiments of Neurospora and Drosophila 
clock mutants shows that both period length and temperature-compensation ap- 
pear closely connected through the stability /degradation rate of clock proteins. 

2. Compensation Mechanisms in Biological Clocks 

Circadian clocks [13] play important roles in the adaptation of organisms to 
the daily changes in their environments. They are found in eukaryotic single-cell 
and multicellular organisms and even in certain prokaryotes. A central task of 
biological clocks involve the timing of physiological events both on a daily and 
seasonal basis. Examples for daily adaptations include the regulation of sleep or 
activity periods in humans and higher organisms, conidiation in fungi, fruit fly 
eclosion or leaf movements in plants. Studies on honey bees show that spatial 
orientation to the sun is closely related to the ability of the insects to determine 
the time of day [4] . Seasonal timing events that are directed by circadian rhythms 
and daylength are reproduction, hibernation or migration of organisms as well as 
flower induction. As described by Van’t Hoff’s rule, temperature has a profound 
effect on most chemical and biochemical processes with Qio values of about 2 or 
higher [5] . Although biochemical (enzyme-catalyzed) reactions are quite dependent 
on temperature [6], the period lengths of circadian rhythms remain practically 
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unaffected by changing the environmental temperature. As a result, Qio values of 
circadian rhythms are generally close to 1. This indicates that circadian rhythms 
function as true physiological clocks, and therefore must contain compensation 
mechanisms against fluctuations of temperature or other environmental influences 
[7,8]. 

Although little is known about how these compensation mechanisms work, 
it is possible to make some general statements when temperature-compensat- 
ion (or period-homeostasis) occurs. Here I will present a theory for tempera- 
ture-compensation, analyse the implications of models for properties of circadian 
rhythmicity and show that both the circadian period length and temperature- 
compensation appear to be closely related through the stability of certain clock 
proteins. Some of these theoretical predictions have experimentally been confirmed 
for the model organisms Neurospora crassa and Drosophila. 



2.1. Modelling Complex Reaction Networks 

It is now generally believed that all chemical change, no matter how complex, is 
the consequence of a number of elementary processes [10]. An elementary process 
takes place in a single step (it cannot be broken down into smaller steps) and 
involves at most three reactant and three product molecules. The kinetic orders 
of an elementary process are identical with the stoichiometric coefficients, and the 
net direction of an elementary process is determined from the free energies of for- 
mation and the concentrations of the participiating species. These constraints on 
kinetic order and reaction direction make the consideration of elementary reac- 
tions a powerful tool in the modelling of complex reaction systems: the temporal 
behavior of any reacting system can be described by a simultaneous set of nonlin- 
ear differential equations, where the number of equations is equal to the number 
of elementary processes. If all rate constants are known, computer programs [12] 
can perform numerical simulations of the development for any system of inter- 
est, where the computations can be compared with experimental observations. 
Although the constraints of elementary reactions do in principle not apply to 
component stoichiometric processes (for example taking the process of transcrip- 
tion), we still can try to approximate them by elementary reactions and refer to 
them as pseudo- elementary processes. The usefulness in looking at elementary or 
pseudo-elementary reactions in the description of complex reaction systems has 
clearly been demonstrated in the study of oscillating reactions [11]. Oscillating 
reactions are probably the most complicated (nonliving) chemical processes that 
have been elucidated in detail. Although a living cell is considerably more complex 
than any of the chemical oscillators considered so far, the principles outlined above 
together with the use of high-speed computers provide the language by which the 
dynamics of various cell processes can (quantitatively) be described. 
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2.2. General Criteria for Period-Homeostasis 

2.2.1. Temperatme-compensation To understand how temperature-compensa- 
tion can be achieved in circadian oscillators (or any other reaction kinetic os- 
cillator) I consider all (elementary or pseudo-elementary) processes Ri that may 
occur in a single cell. Index i identifies the process Ri and its rate constant ki. N is 
the total number of processes. The infiuence of temperature T on Ri is described 
by the Arrhenius equation 



ki = A^e ( 1 ) 

where Ai is the (here assumed temperature-independent) pre-exponential factor, 
Ei is the activation energy, and R is the gass constant. The period length P of the 
rhythm will be a complicated function / of the k'-s: 

P = /(/Ci,/C2,/C3, . . . ,/Cyv) (2) 

By applying the chain rule and expressing dkifdT by means of Eq. (1), the con- 
dition for temperature-compensation can be written as 




Each process i will either positively or negatively (or not) contribute to the over- 
all dP/dT. Thus, by rearranging the processes into period-increasing or period- 
decreasing contributions (when T is increased), we can describe temperature- 
compensation as a balance between these two opposing sets of reactions: 



E 



i,P — increasing 



( X E. = 

\^lnk^ J 



E 

j , P — decreasing 



( dlnf \ 
\dlnkj ) 



X E, 



( 4 ) 



Thus, temperature-compensation is expected to occur whenever the E^-weighted 
sensitivity coefficients din f / dink j of period increasing processes balance with the 
contributions of the period-decreasing processes. In fact, 
for a given reaction network an infinite (!) number of 
Ei-combinations is possible that will satisfy Eq. (4) and 
lead to temperature-compensation. The experimental task 
therefore is to find those (during evolution developed) 
reactions together with their E^ -combinations which are 
responsible for temperature-compensation of a certain 
rhythm. 

The above concept of opposing reactions has a nice 
analogy in the construction of a temperature-compensated 
mechanical pendulum. In the mechanical compensation 
pendulum a variety of approaches exist which differ in the 
type of arrangement to keep the bob of the pendulum at approaximately the 
same length [15]. In the Invar-pendulum (figure to the left) the increase of the 
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pendulum- rod is compensated by a sleeve that expands in opposite direction to 
the expansion of the rod. Hastings and Sweeney [14] were the first to explain 
temperature-compensation in circadian rhythms by the concept of opposing reac- 
tions. However, while the Hastings and Sweeney approach puts the control mech- 
anism outside of the oscillator, there is evidence that the molecular mechansims of 
temperature-compensation ( “the opposing mechanism” ) are an integral part of the 
oscillator. According to Franck [16] both positive and negative (i.e., “opposing”) 
elements (“feedbacks”) are necessary to build a chemical (or physiological) oscil- 
lator. Thus, any reaction kinetic oscillator has, in principle, all elements available 
to become temperature-compensated [19]. 

2.2.2. General homeostasis of the circadian period It is now well established 
that temperature-compensation is only one aspect of a more general homeostatic 
mechanism, which compensates circadian rhythms against various environmental 
influences such as pH, food supply, salinity, etc [7]. Theoretically, simultaneous 
homeostasis conditions may be formulated for each influencing physico-chemical 
property i.e.. 




that lead to an overall compensation of the circadian period [21]. 

3. Models of Circadian Rhythmicity 

3.1. Basic Mechanisms 

During recent years considerable progress has been made to unravel the mecha- 
nisms of circadian rhythmicity [2]. The use of clock mutants has made it possible 
to study the influence of point mutations on period length, phase resetting, as 
well as temperature- or pH-compensation. A considerable amount of evidence sug- 
gests that negative feedback loops play important roles not only in the generation 
of circadian rhythmicity, but also for the compensation mechanisms of circadian 
oscillators. Fig. lA shows the basic mechanism of the circadian clock in Neu- 
rospora crassa together with the putative light input pathpays. The Neurospora 
clock rhythm is normally studied in growth tubes or on petri-plates by means of 
a sporulation assay where the period is determined as the time interval between 
rhythmic sporulation peaks (Fig. IB), [9,17]. The frequency {frq) gene plays an 
important role as it may be considered as the pacemaker of the rhythm: after 
transcription of frq, frq-mRNA is transported into the cytosol and there trans- 
lated into the FRQ-protein. FRQ is transported back into the nucleus where it 
inhibits its own transcription. Thus, transcription, translation and inhibition of 
frq generates a negative feedback loop. A correct timing between active transcrip- 
tion (the positive element) and inhibition of transcription (the negative element) is 
necessary to generate oscillations [16]. Phosphorylation of FRQ-protein influences 
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FRQ-stability, and, as we will see below, is an important factor in determining pe- 
riod length, temperature-compensation and resetting behavior of the rhythm. The 
white-collar proteins WC-1 and WC-2 form a heterodimer (white-collar complex, 
WCC) which serves as a transcription factor in the generation of frq-mRNA. Con- 
stitutively elevated expression of frq results in the loss of rhythmicity (because the 
timed interplay between positive and negative elements causing the oscillations is 
disturbed), while step changes in the level of frq-mRNA resets the phase of the 
clock [13]. Light and temperature, two very important environmental Zeitgeber 
signals, reset the Neurospora clock by changing the levels of frq-mRNA and FRQ- 
protein [9]. By phosphorylating the WCC, light increases the transcription rate of 
frq while Z (the transcription inhibition factor which is probably a phosphorylated 
form of FRQ, Fig. lA) is no longer able to inhibit frq transcription. Temperature 
changes, on the other hand, are expected to influence all processes dependent on 
their activation energies as described in the Arrhenius-equation (1). 



3.2. The Goodwin Model 



Although the reaction scheme in Fig. lA is clearly a simplifled caricature of the 
biochemistry of the Neurospora clock, I think it is of importance to consider mech- 
anisms in form of minimal models. Minimal models contain a minimum amount 
of information, and their study can provide valuable predictions or suggestions 
for further experimental work. Interestingly, more than 30 years ago Brian Good- 
win [18] proposed the possibility of physiological oscillations for the same three- 
dimensional negative feedback loop as described in Fig. 1 A. In the Goodwin model, 
the time-dependence of the variables X {frq-mRNA), Y (FRQ-protein) and Z (tran- 
scription inhibition factor) is described by a set of three coupled differentiel equa- 
tions 



dX 

dt 

dt 

dZ 

dt 



I l'z> - 


(6) 


k2X - kr,Y 


(7) 


k^Y - k(iZ 


(8) 



where each rate constant k{ is related to reaction Ri (Fig. lA). The transcriptional 
inhibition term 1/(1 -h Z^) is often criticized due to its large and unrealistic coop- 
erativity exponent (Hill-constant). This criticism appears unjustifled, because the 
large cooperativity is the result of the small number of intermediates: as soon as 
the number of intermediates in the feedback loop are increased (or the reaction 
order in the kinetic equations is increased [25]) the cooperativity decreases. 

The oscillations generated by the Goodwin equations can be viewed as re- 
peated relaxations to “high” and “low” steady states dependent on whether tran- 
scription process R\ (Fig. lA) is either “on” or “off”. The relaxation time r, which 
describes how fast these steady states are approached determines the period. In- 
termediates E {E = X, Y, orZ) are synthesized and degraded by the rates rg and 
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Figure 1. (A) Basic mechanism of the circadian clock in Neu- 
rospora crassa (for details see text, replotted from [24]. (B) Rhyth- 
mic bands of conidial spores are generally used as an assay for 
the circadian clock in N. crassa. The picture shows the rhythmic 
sporulation observed in two growth tubes for a strain with an 
average period of approximately 21 h. 

kd'E.), and the relaxation time r to the (low or high) steady states, Ess, 
is only dependent on the degradation rate constant kd (with r = k^^ and So = 
S-concentration at t=0): 

^ S ^ =Vs - kdE => E = Ess -h (So - Ess)e~^^^^ (9) 

Although the simple structure of the Goodwin equations may overempha- 
size the importance of the degradation rate constants in the feedback loop, the 
S feature of the model makes the prediction that a decreased degradation 
of any intermediate should result in an increased r and period length: as kd (i.e., 
/c 4 , /c 5 , or /c(i) decreases, r and the period length will increase. Because the long 
and short period frq clock mutants differ by point mutations (a single amino acid 
exchange) in the FRQ-protein [27], we proposed [26] that the frq clock mutants 
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Figure 2. Experimental data [28] showing wild- type frq~^~ 
mRNA and FRQ"^ -protein levels, as well as levels for frq^-mRNA 
and FRQ^-protein (left). By using the determined value of /c 4 , the 
analytical solution of the FRQ-protein (Y) concentration from the 
Goodwin model (no transcription occurs) can be fitted to the ex- 
perimental data by using /c 2 and kr, as adjustable parameters. 



differ in the stabilities of their respective FRQ-proteins. For example, the long 
period length in the frq^-mutant is the result of a more stable FRQ-protein. By 
use of experimental data from Garceau et al. [28] showing how frq-mRNA and 
FRQ-protein cycles in both wild- type and the /rq^-mutemt, it became possible to 
estimate the FRQrdegradation rate constant R 5 (Fig. 2). To do this, we assumed 
that during a first-order frq-mRNA decrease the transcription reaction Ri is off 
(^ki = 0). First, the degradation rate constant /C 4 for frq-mRNA (X) was deter- 
mined; then the analytical solution of the FRQ-protein concentration was fitted to 
the experimental data by using k ‘2 and /C 5 as adjustable parameters [29]. The fits 
and the derived FRQ half-lifes are show in Fig. 2 and Table 1, respectively. The 
table clearly shows that the FRQ^-protein is more stable than wild- type FRQ. 

Recently, Liu et al. [30] confirmed more directly the relationship between 
clock-protein stability and circadian period as predicted by the Goodwin model. 
In one experiment the phosphorylation of FRQ-protein was blocked by 6 -DMAP 
( 6 -dimethylaminopurine), and a decrease in the FRQ-protein degradation rate was 
observed together with a corresponding lengthening of Neurospora’s clock period. 
In another set of elegant experiments certain phosphorylation sites in FRQ were 
blocked by site-directed mutagenesis. Especially the replacement of Ser-513 by 
isoleucine or aspartic acid resulted in a significantly slower FRQ degradation rate 
and a very long circadian period length [30]. Interestingly, for Drosophila similar 
observations were made in the study of the clock gene double-time (dbt) [31]. 
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Figure 3. (A) Experimental period lengths as a function of tem- 
perature in wild-type and in the short period mutant frq^ and in 
the long period mutant frq^. Note the loss of temperature compen- 
sation in the frq^ mutant. (B) Calculated temperature behavior 
for the same system as in A by use of the Goodwin model. 



Double-time expresses a mammalian casein kinase le homolog [32], which regulates 
the clock (PER) protein phosphorylation and its stability (turnover), which in 
Drosophila plays a similar role as FRQ does in Neurospora. It has been found that 
long period and short period mutants of dbt, dbt^^ and dbt^ , respectively, appear 
to be related to more stable or less stable PER proteins. 

3.3. Modelling Temperature-Compensation 

Fig. 3A shows the temperature dependence of period length of Neurospora crassa 
wild- type {frq^) together with the long and short period mutants frq^ and frq^, re- 
spectively. Both wild- type {frq'^) and frq^ have temperature compensated periods, 
while the rhythm of the long period mutant has lost temperature-compensation. 
The period of frq^ decreases with increasing temperature, very much like what is 
observed in chemical oscillating reactions [33]. This behavior can be understood 
by determining the din f j dink j sensitivities of the Goodwin model, i.e., determine 
the contributions for the period increasing the reactions R\, R 2 and R 3 and the 
period decreasing reactions R 4 , Rd, Rg according to Eq. (4) [20]: 



0.0023(F;i + E 2 -f E■^) = 0.3583(F;4 + F 5 ) -h 0.3416F;(i (10) 



Table 1. frq'^-uRNA and FRQ half-lifes [29] 



Species 


Determined Half-Live, h 


frq^ (wild-type)-mK^ A 


3.39 ± 0.42 


frq^ -mKNA 


2.86 ±0.31 


FRQ“^ (wild-type) 


2.9 


FRQ7 


6.2 
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The relatively large numbers in front of the activation energies E 4 - Eq 
(Eq. (10)) indicate the high sensitivity of the period length to the degradation 
rate constants k 4 - as already mentioned (Eq. (9)). The loss of temperature- 
compensation by a change in FRQ-protein leading to a long-period mutant can 
be explained as the result of a point-mutation that leads to a more stable FRQ- 
protein whose rate of degradation (i.e., k^) is reduced. This can be accomplished 
by an increase in activation E^ and a decrease in the pre-exponential factor 
Eq. (1). Both will lead to a decrease in k^, but only an increase in E^ will result 
in an increased temperature-dependence of the period. The larger the increase in 
E^ gets, the larger becomes the temperature-dependence of the rhythm. In this 
way the loss of temperature-compensation in frq^ can be explained due to a more 
stable FRQ-protein (Fig. 3B). The presence of temperature-compensation in the 
short period frq^ mutant indicates a less stable FRQ-protein (due to a increased 
^ 5 -factor), but with the temperature balance (Eq. (10)) still intact. 



3.4. Ockham’s Razor: An Outlook 

We have seen that the relatively simple Goodwin model can describe temperature- 
compensation in the Neurospora clock. Apart from this, the Goodwin model is also 
able to model experimental observations of phase resetting by pulses of tempera- 
ture, heat-shock, light, or cycloheximide [20,22,24]. Unfortunately, space limita- 
tions do not allow to describe these experiments and model calculations in further 
detail. Today, computers make it possible to integrate hundreds of elementary re- 
action with the possibility to model highly complex reaction networks. So why 
bother with minimal models? The problem is that for a large model many of the 
rate constants and activation energies are not known. A model with many ad- 
justable parameters may describe the time evolution of a complex system with 
high accuracy. However, the model’s capability to make realistic predictions is 
generally blurred by the different sets of “acceptable” parameter values which can 
be obtained in the fitting process. 

To develop consistent models one can use Ockham’s razor [23], i.e., avoid un- 
necessary assumptions and complexities. Then, by gradually increasing the com- 
plexity of a model in accordance with the availability of new experimental findings, 
it should be possible to arrive at models which consistently represent an experi- 
mental system. In this respect, it will be interesting to see how our understanding 
of circadian rhythms and their models develop in the future. 
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1. Introduction 

Skeletal muscle converts the thermodynamic force of the non-equilibrium ATP 
/ ADP concentration ratio in the cytosol into mechanical force during contrac- 
tion^ [1]. As such, skeletal muscle function can be described using engineering 
concepts as a chemo-mechano transducer (Fig. 1). The molecular machinery in- 
volved in this conversion consists, amongst others, of filaments of act in and the 
motor protein myosin ATPase, and the calcium (Ca^“*“)-activated switch protein 
troponin [1]. The non-equilibrium cytosolic ATP/ ADP concentration potential is 
thermodynamically buffered by the cellular pool of mitochondria via oxidative 
ADP phosphorylation. Kinetically, this cytosolic potential is buffered on a fast 
(i.e. (sub)second) time scale by creatine kinase (CK) and glycolysis, and on a slow 
(i.e. minutes) timescale by oxidative phosphorylation [2]. Muscle contraction and 
the associated conversion of thermodynamic ATP energy force is under voluntary, 
neural control [1]. It is initiated at the cellular level by action potential-gated re- 
lease of Ca^“*~ ions from the sarcoplasmic reticulum (SR) stores into the myoplasm^. 

As a consequence, the control design of skeletal muscle function is hierarchi- 
cal: the signal initiating contraction originates at a supra-organ level. However, 
some degree of autonomous control - i.e. at the cellular level - of ATP energy force 
conversion is needed. This is necessary to prevent draining of the ATP energy ther- 
modynamic force beyond levels required for homeostasis of the intracellular milieu 

^This mechanical force is ultimately exerted on the skeleton to which the muscle is attached by 
tendons. 

^Contraction is terminated when the free Ca^"*” concentration returns to resting levels driven by 
re-sequestration of Ca^“*" by ATP free energy-driven pumps (SERCA) in the SR membrane. 
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Figure 1. Schematic of skeletal muscle function as a “chemo- 
mechano transducer” . The activity of the transducer is under neu- 
ral control. 



causing cell death. Indeed, empirically, ATP free energy demand during contrac- 
tion cannot outstrip supply on a sustained ba^sis: living muscle fatigues upon high 
intensity stimulation instead of going into rigor. Importantly, it will do so both in 
the intact animal as well as when excised and (su)perfused [3]. Clearly, a transition 
from a hierarchical to an autonomous control design of ATP thermodynamic force 
conversion must occur with increasing duty cycle of stimulation. The mechanistic 
basis for this transition was investigated. 



2. Methods 

2.1. Modeling of ATP Free Energy Metabolism in Contracting Muscle 

2.1.1. Modular ATPase network in contracting skeletal muscle. The minimal 
model that captures the main features of ATP free energy metabolism in con- 
tracting skeletal muscle is a modular branched pathway consisting of three ATPase 
modules: Ei (cellular pool of mitochondria), E 2 (total AM- ATPase) and E^ (total 
SR- ATPase) (Fig. 2 ). These modules consume or produce a common intermediate, 
S, that is related to the cytosolic ATP free energy at rates V] , v -2 and V 3 , respec- 
tively. We chose the cytosolic concentration ratio of ATP and ADP (ATP/ADP) 
for 5. For reasons discussed elsewhere [5] we were unable to use the AG at p ex- 
pression of the ATP free energy. Each module is treated as homogeneous; there is 
no diffusion limitation of S among the components at steady-state. 
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Figure 2. Modular network model of ATP free energy metabo- 
lism in contracting muscle. The boxes labeled 1-3 represent the 
three enzyme assemblies considered. Component 1 is the mito- 
chondria pool in muscle. Component 2 is the actomyosin array in 
the filaments. Component 3 is the sarcoplasmic reticulum Ca^“^ 
pump. The corresponding ATPatse fluxes are labeled Ji- J.^. The 
dashed arrow from box 3 to flux J2 indicates a second connectivity 
between AM- and SR-ATPase via [Ca^^Jcyto. 



ATP free energy metabolism in this network is characterized by four system 
variables: three fluxes Ji, J2 and J3 (units: moles per volume per time), and one 
concentration ratio. The model is defined without considerations of the magnitude 
or direction of the fluxes; that is, until the properties of the model are specified by 
analysis of the available kinetic data on the three components, the model is general. 
For this reason all three components (modules) in the network are termed ATPases 
even though it will be clear that mitochondria in the muscle function as a net 
synthase (despite its reversibility) and the AM and SR components function as net 
ATPases. It is furthermore important to note that in addition to the connectivity 
via the ATP free energy potential ATP / ADP, a second connectivity between rates 
V2 and V3 exists in the network via [Ca‘^~^]cyto (Fig- 2). This connectivity was 
explicitly considered in the analysis (see section 2.1.2. below). At steady-state, the 
relation between the three fluxes in the network is J\ = J 2 + J3, that is, only 
two of the three fluxes in this branched pathway are independent. Neural control 
of ATPase flux in the network occurs by Ca^+ regulation of the activity of AM- 
ATPase and SR-ATPase. Therefore fluxes J2 and J3 are designated as independent 
fluxes in the network. We define the variable a denoting the magnitude ratio of 
these branch fluxes, J 2 / In contracting muscle, AM- ATPase flux accounts for 
approximately 70% of total ATP utilization flux, with the remaining 30% mostly 
due to SR-ATPa.se activity [6]. Therefore, a = 2.3 for the network. It is a constant 
in our analysis as a first assumption. 




34 



J.A.L. Jeneson, H.V. Westerhoff and M.J. Kushmerick 



2.1.2. Steady-state kinetic relations within the ATPase network The following 
set of steady-state kinetic functions describe the dependence of the three enzyme 
modules in the network on the concentration (ratio) of ATP and ADP [5]: 
Mitochondria. The function that describes the dependence of mitochondrial ATP 
synthesis flux J\ on ATP/ ADP in human forearm flexor muscle is [5]: 






. f ATP! ADP \ ^ 

\ r^ATP/ADP I 



~{Jp) 



'p jmin 



( 1 ) 



is the ATP/ ADP ratio at half-maximal ATP synthesis flux and was 
set at 186 [7]. 

Sarcoplasmic Reticulum Ca^^ -ATPase. The function describing the SR-ATPeise 
forward rate dependence on ATP and ADP that was used was [5]: 
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where K^^atp is the affinity for the substrate, and and are ADP inhibition 
constants. ^ of SR- ATPase is at least two orders of magnitude lower than 

[ATP] in mammalian skeletal muscle (10 pM [8] versus [ATP] > 5 pM [1]), and so 
the term j [MgATP] is 0.01. The estimate for K^> that was used was 

0.5 pM [5]. 

Actomyosin ATPase. For this module, the following function describes the steady- 
state rate dependence of AM- ATPase on ATP and ADP, and [Ca?^]cyto [5]: 



'^2 — V2r, 



(iCa'^^jcytoy 



1 -h 



f<ATP 

[ATP] 



(l + lAm'^ {K,or + {{Ca^+lcytof 



(3) 



where ^ is the Michaelis constant for the substrate (10-20 pM for ATPase 

activity; [9]), Ki is the inhibition constant for MgADP (200-300 //M; [9]), and X50 
is the Ca^“^ concentration required for half-maximal stimulation of AM-ATPase 
[5]. The first term in Eq. (3) describes the connectivity of rate U2, and rates V\ 
and U3 in the network via the ATP and ADP concentrations. The second term 
allows explicit consideration of the connectivity of rates V 2 and U3 in the network 
via [CdL^~^]cyto (Fig- 2). Briefly, this relation was developed as follows (see [5] for a 
detailed description): 

First, we derived a relation between [Ca^“^]cyto attained following electrical 
stimulation and [Ca^"*"]^/^ recovered by SR- ATPase activity during the time in- 
terval between subsequent stimulations. At {— 1/stimulation frequency). Next, a 
quantitative relation between time interval At and [Ca^"^]^/^ recovery in muscle 
was derived using the reported kinetics of Ca?~^ removal from the cytosol follow- 
ing a Ca^"^ release pulse in mammalian muscle [10]. This relation was then used 
to compute an array of [Ca‘^~^]sR/{[Ca‘^~^]sR)max{^t) values for a 0-10 Hz range 
of stimulation frequencies. To correct for stimulation frequency-dependent ADP 
inhibition of SR- ATPase (see section 3 - Fig. 4) affecting [Ca^+]5/^ recovery, this 
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array was multiplied with the corresponding array of vs/Vmaxi^t) values calcu- 
lated using Eq. (2) and the measured (A^, [ADP]) covariation in skeletal muscle 
(see section 2.2.). The computed covariation {At, {[Co^~^]sR/{[Ca‘^~^]sR)max)) was 
then transformed using the relation between [Ca^"^]cyto and [Ca^+J^/^ derived in 
the first step, to {At, {[Ca‘^~^]cyto/ {[Ca‘^'^]cyto)max))‘ This covariation was used 
together with the computed {At, ATP/ ADP) covariation, to compute the covari- 
ation {At, V 2 ) using Eq. (3) for stimulation frequencies between 0.3 and 10 Hz. 
Finally, the thus computed (A^, V 2 ) array was correlated with {At, ATP/ ADP) 
and characterised by curve- fitting to yield the function [5]: 

V2(ATPIADP, [Ca^+]oyto) 

= 0 ^ 8 . . (. - ... (4^^)) + 0.06 . (1 - ... (^^)) (4) 

that describes AM-ATPase rate dependence on ATP, ADP and [Cd?'^]cyto (and so 
containing the V 2 and ^;3 connectivity via [Cd?^]cyto) exclusively in terms of the 
ATP/ ADP ratio. 

2.2. Experimental Methods 

2.2.1. 31P NMR spectroscopy Human forearm flexor muscle (n = 6, 5M/1F 
subjects, age 28-55 years) was studied during twitch contractions of the entire 
flexor muscle compartment evoked by external electrical stimulation of the ulnar 
and medial nerves at stimulation frequencies between 0.3 and 2.0 Hz (electric 
pulse duration 0.2 ms; amplitude 250-300 V [11]). '^^P NMR spectroscopic data 
were acquired at 2.0 T from resting and stimulated muscle with 8 second time 
resolution according to methods described in detail elsewhere [11, 12]. The 2 Hz 
range of stimulation frequencies was sufficiently broad to ensure that the maximal 
steady-state of oxidative ATP synthesis in forearm flexor muscle was attained in 
each subject studied. The endpoint of the sustainable steady states was determined 
by the achievement of a steady reduction in PCr without acidification to pH lower 
than 6.9. 

2.2.2. NMR data analysis Raw data were transferred to a Sparc II workstation 
(Sun Micros ystems) and analysed in three steps as described in detail elsewhere 
[12] to obtain high signal-to-noise ^^P NMRS recordings of PCr, Pi and ATP levels 
and intracellular pH (pH^) at steady-state for each stimulation frequency studied. 
Concentrations of PCr, Pi, and ADP at each steady-state were calculated assuming 
the concentration of ATP was 8.2 /xM and total creatine was 42.7 /xM [13], and 
assuming creatine kinase equilibration. Intracellular pH was estimated from the 
chemical shift difference between the PCr and P^ resonance [14]. 

2.2.3. Curve-fitting and statistical analyses Correlations of variables were anal- 
ysed by nonlinear curve-fitting using defined functions in Fig.P software (version 
6.0; Elsevier Biosoft, Cambridge, UK). 




36 



J.A.L. Jeneson, H.V. Westerhoff and M.J. Kushmerick 



2.3. Control Analysis 

Nine flux control coefficients three ATP/ADP concentration ratio control 

coefficients three ATP/ADP elasticity coefficients {^\tpiadp) 

describe each steady-state k of ATP free energy metabolism in our model (Fig. 2). 
Mathematical expressions for flux control and ATP/ADP ratio control coefficients 
in terms of the ATPase elasticities towards ATP/ADP {^\tp/adp) branch 

flux ratio a were previously derived in [5]. These solutions were developed on basis 
of the summation and connectivity theorems, and the branch theorems for flux 
control and concentration control of MCA [5]. The calculation of a complete set 
of 12 control coefficients at a particular steady-state k involved three steps: 

(i) Measurement of the steady-state ATP/ADP ratio in forearm flexor muscle 
using '^^P NMR spectroscopy at seven stimulation frequencies in the range 
of sustainable steady states (up to 2 Hz in experiments; see section 3). This 
(1/At, ATP/ADP) dataset was extrapolated to 10 Hz by curve- fitting. 

(ii) Computation of 97 sets of three ATP/ADP elasticities in the network at 
steady-state for a 10 Hz range of stimulation frequencies (0.3-10 Hz, step- 
size 0.1 Hz; 97 steady-states) using the (1/A^, ATP/ADP) data obtained 
in step (i), and Eqs. (1),(2) and (4) according to the relation [5]: 
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(iii) Computation of 97 sets of 12 control coefficients for flux- and concen- 
tration ratio control in the network at steady-state using the 97 sets of 
elcLSticities computed in step (ii), a branch flux ratio a of 2.3, and the 12 
p/ADP^^) relations previously described in [5]. 



2.4. Regulation Analysis 

Neural regulation of muscle contraction via was examined using the 

MCA concept of the Response Coefficient, defined as Rx = EC'f * 
where Y is a system variable and X is a system parameter or external effector [4] . 
Applied to [Csi^~^]cyto regulation of ATPase flux J 2 we obtain: 
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3. Results 

3.1. Dynamic Range of Steady-States of Oxidative ATP Metabolism in Forearm 
Muscle 

The maximal steady-state of oxidative ATP free energy metabolism in contracting 
human forearm flexor muscle was attained at a stimulation frequency of 1.6 Hz 
(see ref [5] for details). The ATP hydrolysis rate at this frequency was 0.15±0.01 
mol ATP/sec/g muscle (mean SE; n = 6 muscles) [5]. This rate constituted the 
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maximal ATPase flux that could be sustained in this muscle at pH^ T.OibO.l, and 
therefore represents the maximal Ji flux in the particular network model of ATP 
free energy metabolism under investigation attainable in this muscle. At higher 
stimulation frequencies (and associated ATP hydrolysis rates), non-oxidative ATP 
synthesis increased resulting in a decline of pH^ below 6.9 to values as low as 6.7. 
These conditions were not steady-states and were not analysed further. 
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Figure 3. Cytosolic ATP/ADP concentration ratio at steady- 
state in forearm flexor muscle as a function of stimulation fre- 
quency (in Hz). Bars (+SEM) represent average ratios measured 
by '^P NMR spectroscopy in six subjects except for 1.6 Hz {n—5) 
and 1.8 Hz {n=2) (see section 2.2). The solid line represents the 
monoexponential fit to the data. 



Fig. 3 shows the decline in ATP/ADP concentration ratio in the cytosol at 
steady-state of ATP free energy metabolism with increasing neural stimulation fre- 
quency calculated from '^P NMR spectroscopic measurements of steady-state PCr 
and ATP levels and pH^ as described in Experimental Methods. The ATP/ADP 
concentration ratio decreased from 314di8 (mean ibSE, n=6) at 0.3 Hz contractions 
to 98±1 (mean ±SE, n=2) at 1.8 Hz contractions (Fig. 3). In non-stimulated mus- 
cle, the ATP/ADP ratio was 445ib8 (mean SE, n=6). The nonlinear downward 
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trend of ATP/ADP over the studied 2 Hz range of neural stimulation frequen- 
cies was extrapolated to 10 Hz by curve fitting of a monoexponential function^ to 
the measured (1/A^, ATP/ADP) dataset. These extrapolated ATP/ADP ratios 
represent the steady-states that would be attained in the muscle had anaerobic 
glyco(geno) lysis remained insignificant so that the ATPase network under inves- 
tigation (Fig. 2) would continue to apply^ . This approach allowed us to proceed 
with our objective to investigate flux- and ATP/ADP concentration ratio control 
in this particular ATPase network as a function of neural stimulation frequency. 

3.2. Dynamic Range of Elasticities Towards ATP/ADP in the Network 

Fig. 4 shows the dynamic range of the elasticities towards ATP/ADP of the mito- 
chondria {^\tp/adp)^ AM-ATPase {^\tpiadp) SR-ATPase {^atp/adp) 
a 10 Hz range of neural stimulation frequencies computed from Eq. (5) as de- 
scribed in Methods. Note that the elasticity of mitochondria toward ATP/ADP, 
^ ATP/ADP^ is negative whereas that of AM-ATPase, ^\tp!Adp^ SR-ATPase, 
^ATP/ADP' positive because an increase of ATP/ADP reduces the rate of oxida- 
tive phosphorylation, and increases the rate of AM- and SR-ATPase, respectively 
(Eqs. (l)-(4)). 

At low neural stimulation frequencies (0.3 - 0.6 Hz), ^\rpjADP greater 
than two orders of magnitude higher than ^\'i pjADP ^atpiadp when 
^\'VPIADP decreases two- fold as the contraction frequency doubled (Fig. 4). This 
result means that the functions of the AM- and SR ATPase modules are immune 
from changes in ATP/ADP in this stimulation frequency range. In fact, it can be 
argued that is effectively zero over this stimulation duty cycle domain 

[5] because of timescale differences between Ca^“^ pumping, cross-bridge cycling 
and oxidative phosphorylation [1, 5, 9, 10]. As neural stimulation is increased to 
frequencies in the 1-2 Hz range, decreases a further five- fold to the 

same order of magnitude as ^^ppjADP doubles over this frequency range; 

both are higher than ^atp! adp-^ elasticity too rapidly approaches similar 

values towards the high end of this stimulation frequency domain (1.6 - 2 Hz) 
(Fig. 4). Increasing the neural stimulation frequency past 2 Hz shows 
dropping sharply to values approaching zero (Fig. 4). Just the opposite is found 
for ^\ppj^pp that continually increases to even higher values than ^\'ppjj^pp 
(Fig. 4). These results mean that over this neural stimulation frequency range, 
in the absence of any anaerobic glyco(geno) lytic ATP synthase module buffering 

'^The criteria for this function were nonlinearity and saturability (because of biochemical con- 
straints). Other functions were tested but found to not influence the trend of change in ATP/ADP, 
only the amplitude. 

"^In the studied muscle, the ATPase network that was active at neutral stimulation frequencies 
>2Hz was composed of four ATPase modules - i.e. modules 1-3 from Fig. 2 plus a second, 
anaerobic ATP synthesis module. Because of this, any measured ATP/ADP ratio under these 
conditions could not be used to study the behaviour of the particular ATPase network under 
investigation. Therefore the reported approach was taken. 
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Figure 4. Covariation of the computed elasticities towards 
ATP/ADP of mitochondria (solid line), AM-ATPaise (dashed line) 
and SR-ATPase (solid line) and the stimulation frequency (in 
Hz; Log scale). Elasticities were computed on basis of the mea- 
sured/extrapolated ATP/ADP ratio at each frequency as de- 
scribed in section 2.3. The elasticity of mitochondria was scaled 
by a factor of 2 for representation. 



ATP/ADP, function of SR-ATPase is compromised because of significant ADP 
inhibition (and time constraints; see Discussion) , Because of the V 2 — connec- 
tivity in the network via [Cs?^]cyto^ this effectively causes very significant sensi- 
bilisation of AM- ATPase to the ATP free energy potential. Mitochondria, on the 
other hand, have lost any sensitivity to this potential because they are operating 
at near-maximal activity [5, 7]. 

3.3. Kinetic Controls of ATP Free Energy Metabolism in Contracting Forearm 
Muscle 

Fig. 5 shows the computed distribution of kinetic control of the ATP/ADP concen- 
tration ratio control in the ATPase network of Fig. 2 as a function of stimulation 
frequency. Values of the ATP/ADP control coefficient for the cellular pool of mi- 
tochondria are positive whereas those for AM- ATPase 

and SR-ATPase are negative because an increase in the activity of 
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stimulation freq (Hz) 



Figure Covariation of the computed ATP/ADP concentra- 
tion ratio control coefficients of mitochondria (top line), AM- 
ATPase (bottom line) and SR-ATPase (middle line) and the stim- 
ulation frequency (in Hz; Log scale). Control coefficients were 
computed on basis of the elasticities towards ATP/ADP shown 
in Fig. 3 and a constant branch flux ratio of 2.3 as described in 
section 2.3. 



mitochondria will increase the ATP/ADP ratio in the cytosol, while increased ac- 
tivities of AM- and SR,-ATPase will decrease this potential. Control over the ATP 
free energy potential in the network is shared between all three ATPase modules 
over the 10 Hz range of stimulation frequencies analysed, but dominated by the 
activities of mitochondria and AM- ATPase. Since our investigation focused on the 
control structure of the chemo-mechano force conversion in muscle, the remainder 
of this analysis will focus on the kinetic controls of the actomyosin- and mitochon- 
drial ATPase fluxes in the network. 

Figure 6 shows the computed distribution of kinetic control of mitochondrial 
ATP synthesis flux ( Ji ) in the ATPase network of Fig. 2 as a function of stimulation 
frequency. At low stimulation frequencies, kinetic control of ATP synthesis flux Ji 
resides completely in the activity of the ATP-consuming modules -f C-p > 
0.95; 0.3 — 1.0 Hz) (Fig. 6). As the stimulation frequency doubles from 1.0 to 2.0 
Hz, however, kinetic control of flux JI increasingly shifts mostly from AM- ATPase 
towards mitochondria. At 3 Hz, and C/ ^ are equal, and, at 10 Hz, the initial 
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control distribution has completely inverted from a demand- to a supply control 
distribution = 0.1; = 0.9) (Fig. 6). 



1 n 





0.1 05 1 2 5 10 

stimulation freq (Hz) 



Figure 6. Covariation of the computed mitochondrial ATPase 
flux J[ control coefficients of mitochondria, AM- ATPase and SR- 
ATPase, and the stimulation frequency (in Hz; Log scale). Flux 
Ji control coefficients were computed on basis of the elasticities 
towards ATP/ADP shown in Fig. 3 and a constant branch flux 
ratio of 2.3 as described in section 2.3. 



Figure 7 shows the computed distribution of kinetic control of AM- ATPase 
flux J ‘2 in the ATPase network of Fig. 2 as a function of stimulation frequency. 
At low stimulation frequencies, kinetic control of flux J 2 resides completely in the 
activity of the AM- ATPase {Cp = 1.0, Cp = Cp = 0; 0.3 - 1.0 Hz) (Fig. 7). As 
the stimulation frequency doubles from 1.0 to 2.0 Hz, however, kinetic control of 
flux J2 begins to shift from AM- ATPase towards mitochondria. At 3 Hz, and 
Cp are equal, and, at 10 Hz, mitochondrial activity has dominant kinetic control 
of AM- ATPase flux J 2 {Cp = 0.95, Cp = 0.35) (Fig. 7). 



4. Discussion 

The main finding of our study was that kinetic control of ATP thermodynamic 
force conversion in the network analysed here exhibits a transition from AM- 
ATPase to mitochondria with increasing duty cycle of neural stimulation (Figs. 
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Figure 7. Covariation of the computed actomyosin ATPase flux 
(J 2 ) control coefficients of mitochondria, AM- ATPase and SR- 
ATPase, and the stimulation frequency (in Hz; Log scale). Flux 
J ‘2 control coefficients were computed on basis of the elasticities 
towards ATP/ADP shown in Fig. 3 and a constant branch flux 
ratio of 2.3 as described in section 2.3. 

6 and 7). Here, we will discuss how this result corresponds to a transition from 
a hierarchical to an autonomous control design, and what general principle in 
physiological control design of complex biological systems lies at the basis of this 
behaviour of this particular metabolic network. For a detailed discussion of other 
aspects of the analysis and its results, the reader is referred to a previous paper 
[5], 

The first question we will examine is whether the dynamics of the control 
structure of ATPase flux control in this network described by Figs. 6 and 7 agree 
with the known behaviour of skeletal muscle function in the living organism. That 
is, strict neural regulation of mechanical force output over a large range of work- 
loads via variation of the stimulation frequency and duty cycle [1] that only breaks 
down at high intensity stimulation where fatigue sets in [1, 3]. The latter is gener- 
ally associated with large anaerobic ATP synthesis flux indicative of mitochondrial 
ATP synthesis flux operating near its Vmax [1, 3]. To answer this question, we must 
first establish what system properties are necessary for hierarchical, neural control 
of muscle function. Neural regulation of mechanical output of muscle is imple- 
mented at the cellular level by [Ca^'^Jcyto regulation of contraction. Hence, hier- 
archical neural control of muscle function requires first and foremost that enzyme 
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(-module)s that are highly sensitive to [Cd?'^]cyto changes have high kinetic control 
over actomyosin cross-bridge cycling and the associated ATPase flux (=flux J 2 in 
our network; Fig. 2) and vice-versa. We must therefore examine the [Cd?^]cyto 
regulability of flux J 2 in our network using the results of the analysis of kinetic 
control of flux J 2 (Fig. 7) and Eq. (6) (section 2.4.). Fig. 7 shows that in the 
low stimulation frequency range (0.3-1 Hz), R'^^ 2 -^ is determined only by the AM- 
ATPase term, because == 0, and = 1. Taking = 3, 

corresponding to the kinetic order of the [Ca?'^]cyto dependence of AM- ATPase 
rate Eq. (3)), ^^^ 2 + = 3 over this stimulation frequency range. As the stimulation 
frequency is further increased to 2 Hz, decreases to 0.9, increases to 0.1 
and to -0.05 ^ Under these conditions, = 3 and = 2 corresponding 
to the kinetic order of the [Csi^~^]cyto dependences [6], while = 0 in a predom- 
inantly fast- twitch muscle for this stimulation frequency range. Hence R'^^ 2 + in 
this 1-2 Hz range is likewise dominated by the AM- ATPase term, and 

equal to 2.6. Taken together, these results mean that forearm flexor muscle func- 
tion can indeed be hierarchically regulated over a stimulation frequency range up 
to 2 Hz. Mechanical output in this stimulation frequency range is near-completely 
proportional to the activity of AM- ATPase which is in turn regulated by [Ca^“^]cyio 
according to 3^’^ order kinetics. When the stimulation frequency is increased above 
2 Hz, however, neural regulability of mechanical output via AM-ATPase activity 
rapidly diminishes because drops sharply, approaching 0.5 at 5 Hz (Fig. 7). As 
a consequence, the AM-ATPase term of Rq^^ 2 + is maximally only 1.5 at 5 Hz - i.e. 
twofold lower than its value at low stimulation frequencies. This means that 
= 3 will only be attained if the sum of the mitochondrial and SR- ATPase terms 
is 1.5 or higher. At 5 Hz, has increased to 0.65. Since remains relatively 
small (-0.2), a two-fold increase of Vmax of mitochondria due to [Ca?~^]cyto effects 
[15] would be required. Such dramatic kinetic effects of high intensity stimulation 
have not been experimentally found (16). A more realistic estimate of under 
these conditions would be 0.1 [16]. Consequently, R'^^ 2 + at 5 Hz is between 1.5 
and 2, and decreases even further at higher stimulation frequencies. This means 
that in this stimulation frequency range, hierarchical, neural regulation of forearm 
muscle function by simple [Cs?~^]cyto frequency encoding breaks down. Control of 
mechanical output has distributed over all three modules, with dominant control 
increasingly in the activity of mitochondria. And since this activity is much less 
regulable by [Ca^“^] eventually the system will become fully autonomous. 

The functional consequence of this system regulation design is that neural or 
electrical stimulation of muscle cannot deplete ATP free energy. We conclude from 
our analysis, therefore, that this well known behaviour of skeletal muscle [1, 3] 
can be entirely explained by the behavioural properties of the ATPa.se network 



^The negative values of reflects the negative control over this flux exerted by SR- ATPase 
via its effect on cell energetics - i.e. an increase in vs causes a decrease of ATP/ADP[5]. 
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analysed; other mechanisms are not needed even though they may be present and 
functionally active. 

Central to this operational design of muscle function is the network property 
that AM-ATPase flux control at low stimulation frequencies is not shared. The ac- 
tivity of AM-ATPase under these conditions could as such be described in classical 
kinetic control terms of the ‘rate-limiting step’. It is of interest, and we believe of 
significance, that this finding is at odds with the general notion that the field has 
taken away from control analysis of metabolic networks - i.e. that control is gener- 
ally distributed and variant from one steady-state to the next, and therefore that 
the notion of a rate-limiting step is no longer a relevant biochemical concept (e.g. 
[17]). While this may be true for isolated metabolic networks, our present work and 
that of others (e.g. [18, 19]) demonstrates that in the living, functional cell, where 
there is net flux driven or pulled through the pathway associated with cell activity, 
a ‘rate limiting step’ in kinetic control of this flux is often found. Our analysis of 
the regulation of muscle contraction above illustrates why this particular control 
design is physiologically necessary and even essential. Because regulability of flux, 
one of the conditions of organism viability, is lost when flux control is distributed. 
We therefore propose the concept of “physiological rate control” to describe this 
specific flux control design in living cells (as opposed to distributed, ‘biochemical’ 
rate control). 

Finally, a general physiological design principle in biological systems known 
as time hierarchy or time scale separation [4] lies at basis of this particular AT- 
Pase flux control structure in muscle, both regarding the physiological rate control 
by AM-ATPase at low neural stimulation rates as well as the transition to dis- 
tributed control at higher frequencies. Time scale separation results from the fact 
that biological systems involve simultaneously many reactions which take place 
with different velocities - e.g. the different timescales of contraction and gene ex- 
pression in muscle. As a result, the description of such a system using a set of 
ode’s can be simplified by selection of a particular timeconstant domain. This 
essentially filters out all reactions that are much faster or slower and can there- 
fore be assumed in (quasi-) steady-state, and therefore have no control over the 
system dynamics that are investigated [4]. Here, a similar principle applies. The 
time scales of [Cdi^~^]cyto and ATP/ADP changes in muscle at low stimulation fre- 
quencies are separated by three orders of magnitude (milliseconds versus seconds, 
respectively) [1, 5]. As a consequence, AM-ATPase activity in the network at the 
“ATP/ADP timescale” is under these conditions only affected by the elasticity to- 
wards ATP/ADP. Largely because this elasticity is very small in this ATP/ADP 
concentration range (Fig. 4), kinetic control of AM-ATPase flux resides in this 
module. When the stimulation frequency approaches 2 Hz, however, [Cd?^]cyto 
homeostasis by SR-ATPase is increasingly compromised due to both ADP inhibi- 
tion of and time constraints. As a result, timescale separation of [Cdl^^]cyto and 
ATP/ADP changes is lost and effective sensibilisation of AM-ATPase to the ATP 
free energy potential occurs because of the V 2 — t ’3 connectivity in the network 
via [Cd?^]cyto’ This together with the rapidly decreasing ATP/ADP sensitivity 
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of mitochondria in this frequency range causes the shift of contractile ATPase 
flux control from actomyosin to mitochondria. The adaptive response of skeletal 
muscle in CK knock-out mice to overexpress SR- ATPase [20] can in this light 
be explained as an attempt to re-establish timescale separation of [Ca^*^] and 
ATP/ADP changes, at least for low neural stimulation duty cycles, and thereby 
achieve some regulability of muscle function that would otherwise be lost in this 
genotype. 

In summary, this work proposes that the concepts of “physiological rate con- 
trol design” and time hierarchy in biological systems underlie neural regulability 
of mechanical output and ATP free energy homeostasis in skeletal muscle. 
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1. Introduction 

With the developments in genomics there has been an increasing focus on the 
behaviour of complete biological systems and this has resulted in the development 
of Systems Biology as a new research field in biology [23]. Biological data from 
all levels of metabolism, such as genome, transcriptome, proteome, metabolome, 
any of the interactomes (protein- protein, protein-DNA, protein-mRNA, etc.) as 
well as the fluxome [25] are to be integrated in order to view a cell, an organism 
or even a population as a whole rather than investigating the single components 
of the system (Fig. 1). In order to integrate the wealth of information at the 
different levels of the metabolism, mathematical models play an important role, 
and Systems Biology is therefore often associated with quantitative investigation 
of the biological system under study. Much information on individual components, 
or sub-systems, in living cells has been obtained during the 20th century, but with 
tools such as DNA arrays, proteomics, metabolite profiling, it is now possible to 
analyse all the components in the system at the same time, thereby enabling a 
move from reductionist approaches to a global or a system approach [28]. 

This new view on biology was triggered by the large sequencing efforts over 
the past years, that have resulted in genome sequence information on currently 
more than 500 hundred species. As stated by the GOLD Database [2] in July 2002, 
almost 100 genomes have been fully sequenced, and there are more than 300 ongo- 
ing prokaryotic and more than 200 ongoing eukaryotic genome sequencing projects. 
In order to harness the information content in the sequenced genomes it is essential 
to develop high-throughput analytical techniques as well as powerful mathemat- 
ical tools that enable integration of the large data sets. The integrated approach 
taken in Systems Biology will play a key role in investigating and understanding 
the genetic information and the overall operation of cell function. 

Systems Biology will be supported by high-throughput technology and robot- 
ics that will allow rapid generation of data at almost all levels of the metabolism. 
Some high throughput technologies have been developed for genomics, proteomics 
and transcriptomics, whereas there are currently no available high through-put 
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like techniques for analysis of the complete metabolome, and here novel technolo- 
gies are currently being developed that allow an efficient screening of the (low 
molecular weight) metabolome. Even though recent developments show that it is 
possible to screen for up to 300 metabolites or more in a single experiment [16, 31], 
the different structure and different chemical properties of the many intracellular 
metabolites require different analytical techniques. However, metabolite profiling 
will allow exact quantification of a metabolic state. Therefore, the identification 
of the function of orphan genes will be facilitated, and the metabolome is likely to 
give new insight into the function and control of metabolism. Hence, metabolome 
analysis should be considered as an integrative part of Systems Biology. 

The vast amount of biological information is usually maintained in compre- 
hensive databases, accessible via internet, and of which almost 300 have just been 
summarised [Ij. This information will be used as a basis for mathematical mod- 
elling and will build a platform for the development of whole cell models for the 
prediction of phenotypic behaviour. Modelling of whole cell behaviour is going to 
be facilitated by a tremendous increase in computer power [44, 11, 28], and the 
establishment of fully mechanistic models will also require a large effort in deter- 
mining various kinetic parameters. To this date, metabolic models are therefore 
first generation models where many simplifications are introduced [20]. 

Within both metabolic engineering and functional genomics, the application 
of a macroscopic view will facilitate direct genetic modifications and the assignment 
of function to orphan genes, respectively [25]. Recently, a complete library of single 
gene deletion strains of Saccharornyces cerevisiae was completed [48] and large- 
scale phenotypic and functional investigations are being conducted [12, 27]. After 
the finalisation of the sequencing of the human genome (International Human 
Genome Sequencing Consortium, 2001), focus has clearly moved to the protein 
level, and tremendous effort is currently undertaken to identify the function of all 
human orphan genes [26]. 

The present chapter intends to focus on recent developments in mathematical 
modelling of metabolic networks (explicitly stoichiometric models) and metabolite 
profiling. For the section on mathematical modelling emphasis will be put on 
models based on stoichiometry of metabolic reactions, as this represent an efficient 
tool to combine genetic information with the flux through a metabolic network on 
a genome-scale. 



2. Stoichiometric Modelling of Metabolic Networks 

Several mathematical methods have been developed to analyse metabolic networks. 
One of them is stoichiometric modelling. Stoichiometric models are solely based 
on the time invariant properties of metabolic networks as opposed to kinetic mod- 
els that include both, stoichiometric and kinetic information [20]. Stoich iometric 
models are simple and comparably easy to build, however, since no regulatory 
and kinetic information is included, it is only possible to model (pseudo-) steady 
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Figure 1 . Systems Biology aims at the integration of biological 
data from any level of metabolism, such as genome, proteome, 
transcriptome, metabolome, as well as the interactome in order 
to quantitatively describe the metabolic behaviour. The metabolic 
behaviour of a living cell is highly infiuenced by metabolic regula- 
tion and the interaction among and between any of the metabolic 
levels. Through mathematical modelling, the different layers of 
regulation may be revealed, and it may hereby be possible to 
find relations between the different levels in the overall metabo- 
lism, e.g. between the fiuxome and the genome and between the 
metabolome and the genome. 



state metabolic behaviour. Still, their simplicity makes stoichiometric models at- 
tractive compared to kinetic models. The stoichiometry of a metabolic reaction is 
usually known, whereas detailed kinetic information is only available for a frac- 
tion of all known metabolic reactions. Furthermore, stoichiometric models allow a 
straightforward construction of genome-scale metabolic whole-cell models [7]. 

Stoichiometric models are built by setting up mass balances around all in- 
tracellular metabolites, which leads to the stoichiometric matrix where the 
element G[j contains the stoichiometric coefficient of metabolite i in the reac- 
tion. With the assumption of (pseudo) steady state, the following relation can be 
derived and used for the determination of the metabolic fiux vector v [30, 40]: 

V = 0 (1) 

Usually, this equation system is underdetermined, that is, the number of 
mass balances around intracellular fiuxes is smaller than the number of fiuxes 
that need to be calculated and additional information or assumptions are required 
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to determine the fluxes. Currently, there are flve different methodologies that are 
commonly applied to investigate the structural properties of stoichiometric models 
and to predict or simulate metabolic behaviour: Metabolic Flux Analysis, Meta- 
bolic Network Analysis, Flux Balance Analysis, Extreme Pathway Analysis and 
Elementary Flux Mode Analysis, and these will be discussed further below. The 
former three methodologies allow the calculation of a particular solution, whereas 
the latter two may allow the screening for a number of different flux distributions 
or even the computation of all theoretical possible pathways within the defined 
metabolic network (Fig. 2). 



Stoichiometric Model 



AnnoUted Genome 
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(FBA) 
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Figure 2. Mathematical modelling of stoichiometric metabolic 
networks. From annotated genome information, biochemical text- 
books, recent publications and reviews, and from pathway 
databases, information on the presence and the stoichiometry of 
metabolic reactions may be extracted, and a stoichiometric model, 
here represented by the stoichiometric matrix , can be set up. 
In a next step, different methodologies may be applied to investi- 
gate the metabolic behaviour. The most recent and currently most 
common approaches are Metabolic Flux Analysis (MFA), Meta- 
bolic Network Analysis (MNA), Flux Balance Analysis (FBA), 
Extreme Pathway Analysis and Elementary Flux Mode Analy- 
sis (see text for details). Stoichiometric models that include all 
known metabolic reactions are referred to as genome-scale meta- 
bolic models or metabolic whole cell models. 
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2.1. Genome Scale Metabolic Models 

Stoichiometric models that include all known metabolic reactions, referred to as 
genome-scale metabolic models or metabolic whole cell models, enable the estab- 
lishment of direct links between the genome and metabolic fluxes. From annotated 
genome information, biochemical textbooks, recent publications and reviews, and 
pathway databases, it is possible to extract information on the presence, the stoi- 
chiometry and the reversibility of metabolic reactions, and a stoichiometric model 
may thus be constructed (Fig. 2). Genome-scale metabolic models have been pub- 
lished for the prokaryotes Escherichia coli, Haemophilus influenzae and Helicobac- 
ter pylori [33] as well as for the eukaryote Saccharomyces cerevisiae [17, 13]. The 
organisms were investigated under different physiological and genetic conditions 
using Flux Balance Analysis or Extreme Pathway Analysis, and when compared 
to experimental data high agreement between experimental results and in silico 
computation was found [13, 8]. Currently, there are also several projects aim- 
ing at modelling whole-cell behaviour, including interaction (e.g. protein-protein, 
protein-DNA) and regulatory information [45, 44, 22]. 

2.2. Metabolic Flux Analysis (MFA) 

In Metabolic Flux Analysis a number of exchange fluxes are measured to render 
a determined equation system, thus enabling the calculation of intracellular fluxes 
and identification of pathway activity [40]. Typically, small models with 50 to 100 
reactions, and a low degree of freedom are used. There are numerous examples, 
where MFA has successfully been applied to investigate the metabolism of micro- 
bial as well as animal cells by comparison of flux distribution of different mutants 
to a reference strains under different physiological and genetic conditions [20]. 

2.3. Metabolic Network Analysis (MNA) 

In Metabolic Network Analysis, labelling experiments are used to identify the path- 
way topology and to quantify the fluxes. Here, the cells are fed with ^'^C-labelled 
substrates and the ^'^C-enrichment in the different carbon atoms of intracellular 
metabolites are measured using NMR or GC/MS [42, 6]. The enrichment patterns 
of the individual carbon atoms give unique information about the activity of the 
different pathways operating. Besides pathway identification (as in MFA), this 
approach also enables investigation of the activity of parallel pathways, compart- 
mentation, futile cycling and metabolic channelling [6, 47] 

2.4. Flux Balance Analysis (FBA) 

Often, it is not possible to determine an appropriate number of experimental con- 
straints to determine the metabolic fluxes in the model. In this case, a possibility 
to investigate the metabolism is to formulate and extend the stoichiometric model 
with an objective function, such as the optimisation of growth or the optimisa- 
tion of metabolite production [14, 9]. The resulting optimisation problem can be 
efficiently solved using linear programming, see for instance [3], and an optimal 
flux distribution may be calculated corresponding for example to optimal growth 
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rate or optimal biomass yield. Flux Balance Analysis has recently been applied 
to genome-scale metabolic models including several hundred reactions and mass 
balances around metabolites, for a number of prokaryotes and for Saccharomyces 
cerevisiae. For E. col% the effect of single gene deletions in the central carbon 
metabolism were computed on different media, and in 86 of all investigated cases 
(68 out of 79), the computed results were in agreement with previously published 
experimental results [8]. Similar results were found for the central carbon metab- 
olism in Saccharomyces cerevisiae [13], and here growth on a synthetic complete 
medium was simulated for 599 single gene deletion mutants and the computed 
results were consistent with experimental results in 87 % of the cases (523 out 
of 599) [18]. Heijnen and co-workers [41] used a stoichiometric model of Saccha- 
romyces cerevisiae and simulated the biomass yield for a number of single gene 
deletion mutants of the glyoxylate cycle and gluconeogenesis finding quantitative 
agreement with experimental data for different feed ratios of ethanol and glucose 
in 3 out of 4 investigated single gene deletion mutants. Maranas and co-workers 
[5] applied mixed integer linear programming in order to investigate the effect 
of foreign reaction insertions from a set of 3400 reactions on amino acid produc- 
tion and suggested a number of genetic modifications that may show higher yield. 
The linear optimisation problems in FBA often exhibit multiple optimal solutions. 
This means that an infinite number of flux distribution exists that lead to the 
same optimal objective value, e.g. optimal growth or optimal metabolite produc- 
tion. Applying mixed integer linear programming, Lee et al [24] have developed 
a methodology to calculate all so called alternate optima in linear programming 
models (referring to a set of optimal pathways). 



2.5. Elementary Flux Mode Analysis and Extreme Pathway Analysis 

In contrast to the first three approaches, where a particular solution is obtained. 
Elementary Flux Mode Analysis and Extreme Pathway Analysis provide a charac- 
terization of all possible flux distributions fulfilling the stoichiometric constraints. 
As an example, when investigating conversion of a substrate to biomass or to a 
specific metabolite, it is not only one optimal route that is found, but also equally 
optimal or suboptimal alternatives. Elementary Flux Mode Analysis and Extreme 
Pathway Analysis also suggest a new definition of pathways as compared to tra- 
ditionally classified pathways, such as glycolysis, pentose phosphate pathway, etc. 

Both approaches use convex analysis in order to calculate a unique set of 
pathways. In Extreme Pathway Analysis a set of positively linear independent 
pathways mathematically equivalent to the edges of a convex solution cone are 
calculated [35] , while Elementary Flux Mode Analysis also includes some positively 
linear dependent pathways via a simplicity constraint and this allows the direct 
calculation of all theoretical possible pathways within a metabolic network [37, 39]. 
Whereas Extreme Pathways could be said to be mathematically more fundamental, 
it is sometimes easier to interpret the Elementary Flux Modes from a biochemical 
perspective. 
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Elementary fiux mode analysis was successfully applied for the improvement 
of product yield and for functional reconstruction from genomic data [36]. Liao 
and co-workers verified experimentally the ability of this approach to predict meta- 
bolic behaviour due to genetic modifications by increasing the yield of aromatic 
metabolites on carbohydrates in E. coli. The fiexibility and the robustness of a 
genome-scale metabolic network of H. influenzae were investigated using Extreme 
Pathway Analysis [34]. Since the application of Elementary Flux Mode Analy- 
sis and Extreme Pathway Analysis for large metabolic networks often leads to a 
combinatorial explosion in the number of calculated pathways, the genome-scale 
metabolic network of H. influenzae was divided into seven subsystems, which al- 
lowed the application of Extreme Pathway Analysis within each subsystem [34]. 
Schuster and co-workers recently released the software “Separator” , which enables 
a systematic division of metabolic networks into subsystems, and investigated the 
metabolic network of Mycoplasma pneumoniae [38]. 



3. Metabolite Profiling 

Within transcriptomics and proteomics, high throughput analysis methods are 
well established (e.g. genome- wide monitoring of all mRNA levels using DNA 
chip technology and analysis of up to several thousand protein levels using 2D 
gel electrophoresis). The field of metabolomics, however, is less mature. Although 
metabolite profiling has long been applied for diagnosis of human diseases, e.g. 
detection of urinary organic acids using gas chromatography [43], it is not until 
recently that increasing effort is undertaken to develop methods for screening of 
a high number of intracellular metabolites [15], e.g. for the purpose of identifiy- 
ing the function of orphan genes [46]. For Arabidopsis thaliana [16] and potato 
tuber [32], gas chromatography- mass spectrometry was applied and enabled the 
simultaneous detection of more than 300 and 150 intracellular metabolites, re- 
spectively, such as sugars, sugar phosphates, amino acids and organic acids. Using 
such data multivariate data analysis may be applied to investigate and compare 
metabolite profiles in different mutants or for growth at different conditions. Fiehn 
et al. used principal component analysis and were able to discriminate different 
genotypes into clusters and to assign metabolic phenotypes. Oliver and co-workers 
used NMR to generate metabolite profiles of a number of S. cerevisiae mutants 
[29]. Using principal component analysis and discriminant function analysis, they 
found that deletion mutants having lost similar function group into the same clus- 
ter. They propose that this may enable the assignment of function to orphan genes 
when strains that were deleted in an orphan gene are compared to strains deleted 
in known genes. 

An advantage of metabolite profiling is that it will allow exact quantification 
of intracellular and extracellular metabolite concentration using for example gas 
and liquid chromatography combined with mass spectrometry. In contrast, expres- 
sion level analysis using for instance DNA chip analysis only allows a qualitative 
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analysis of a change in metabolism, when investigating different genetic or physi- 
ological environments. Hence, metabolite profiling may allow a better assignment 
of exact phenotypes and this will support and facilitate the assignment of function 
to orphan genes. 



4. Combining Metabolite Profiling and Mathematical Modelling 

As a complement to using multivariate data analysis for the elucidation of the 
function of an orphan gene, it would be attractive to combine metabolite profiling 
with mathematical models of metabolic networks. In this way, one could benefit 
from and exploit the metabolic and physiological knowledge represented in the 
model. A theoretical framework was recently proposed that may allow the use of 
a combination of metabolite profiling and mathematical modelling [19]. Here, a 
genome-scale metabolic model, needs to be designed, that attempts to include all 
known metabolic reactions. Comparison to the annotated genome sequence will 
allow the identification of reactions for which no corresponding gene exists. For 
example, for S. cerevisiae a genome-scale metabolic model was designed compris- 
ing 1175 reactions (Forster et al., [17]). Among these reactions, 140 reactions were 
included based on biochemical evidence, but with no corresponding gene. Some 
50% of these reactions are involved in amino acid metabolism, nucleotide metab- 
olism, lipid metabolism, etc. The theoretical framework aims at identifying the 
corresponding orphan genes of those reactions. Once a genome-scale metabolic 
model is set up, all theoretical possible pathways within a metabolic network can 
be calculated using Elementary Flux Mode Analysis [37]. Of the theoretical pos- 
sible pathways, the active pathways may be identified by conducting metabolite 
profiling. By comparing the metabolite profiles and the corresponding active path- 
ways of the reference strain to a single gene deletion mutant, a shift in the active 
pathway structure can be identified. Using the information of the active path- 
way structures one may track down the reactions and pathways that have caused 
the change in the metabolite profile, and thus allowing the identification of the 
function or functional class of an orphan genes. 



5. Conclusion 

Within Systems Biology, mathematical modelling and metabolite profiling will 
support and facilitate the analysis of metabolic behaviour. Particularly, metabolic 
engineering and functional genomics will clearly profit from recent advances in 
mathematical modelling and metabolite profiling. Elementary Flux Mode Anal- 
ysis and Flux Balance Analysis have successfully been applied to predict meta- 
bolic behaviour under different physiological and genetic conditions, and this will 
significantly support an efficient design of metabolically engineered strains with 
improved properties. The predictability will in the future be improved including 
regulatory constraints using metabolite profiles as well as expression data. 
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Metabolite profiling is currently a research area that undergoes tremen- 
dous efforts to enable a simultaneous quantitative detection of several hundred 
of metabolites or even of the full (low molecular weight) metabolome. In the near 
future, metabolite profiling combined with mathematical modelling will become a 
routine procedure, further accelerating progress within Systems Biology. 
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When a hormone or growth factor binds to a receptor, a multitude of pro- 
cesses is set in motion leading to coordinated changes in enzyme activities and 
gene expression in the cell. The steps involved in intracellular signal transduction 
include the activation of receptors, the mobilization of second messengers such as 
cyclic AMP and calcium ions, and the activation or inhibition of protein kinases 
and phosphatases. The complexity of signal transduction networks is illustrated by 
the estimated numbers of about 1000 protein kinases and 500 protein phosphatases 
encoded in the human genome.^ 

For many extracellular signals, major components of the corresponding in- 
tracellular pathways have been identified. For example, hormones such as nora- 
drenaline elicit regular oscillations in the cytoplasmic concentration of calcium in 
their target cells. These calcium signals can be measured with high temporal res- 
olution in single cells, and the channels and pumps that mediate the calcium con- 
centration changes have been characterized. Calcium, among many other signals, 
regulates the activities of protein kinases and phosphatases, and, again, major 
constitutents of protein phosphorylation cascades have been identified. Data on 
their kinetic behaviour are also becoming available. 

The information on the molecular components allows the construction of 
mathematical models of signalling networks. Mathematical modelling can address 
several, frequently overlapping, questions. First, although in many instances the 
major molecules of a pathway are known, the mechanisms of their interaction 
are rarely fully understood, and often alternative schemes have been proposed. 
By analysing and simulating mathematical models the consistency of a proposed 
mechanism with the data can be cissessed rigorously. Moreover, modelling may 
help to design experiments that can discriminate between hypotheses and estab- 
lish altogether new hypotheses. Second, a mathematical model provides a coherent 
representation of the experimental data for a particular system and allows to anal- 
yse the impact of all molecular components on the system’s behaviour. In this way, 
modelling can guide the acquisition of experimental data needed to characterize the 
system’s dynamics and regulatory properties. Third, the quantitative predictions 
made by mathematical modelling may form a basis for the rational manipulation 
of signalling networks for therapeutic and biotechnological purposes. 

When compared to the modelling of biochemical systems in metabolism, 
the modelling of signal transduction networks poses particular problems. There 
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is a much larger array of classes of molecular components, including receptors 
(with and without enzymatic activities), various enzymes (besides protein kinases 
and phosphatases also acetyltransferases and deacetylases and methylases), sec- 
ond messengers, trimeric and monomeric G-proteins, and scaffolds. Pathways are 
not constituted by continuous chemical modifications of molecules as in metabo- 
lism, but rather by a signal flow through successive reversible transformations of 
components between inactive and active forms. As signalling pathways are mainly 
determined by protein-protein interactions, there are few chemical constraints, 
allowing for complex network connectivities. Signalling processes are frequently 
transient in nature, and cellular responses depend on signal propagation time, du- 
ration and amplification. Theoretical approaches are being developed to concisely 
define and investigate these properties for transient states. In more complex sig- 
nalling networks, instabilities of steady states may occur and result in oscillations 
or the coexistence of multiple steady states. The analysis of the latter behaviour 
is particularly relevant in the context of the control of gene expression. 

The present chapter collects modelling papers from different areas of signal 
transduction. Meyer-Hermann et al. model the dynamics of the second messen- 
ger calcium. In neurons, calcium is involved in long-term modulation of synaptic 
transmission, and this problem is studied for various stimulation protocols. The 
contribution by Schoeberl et al. focuses on protein phosphorylation cascades. Pre- 
ceded by a discussion of general modelling aspects, a mathematical model of the 
EGF-induced MAP kinase pathways is introduced. The following three papers are 
devoted to intercellular signalling at the molecular, cellular, and developmental 
levels. Gosh et al. study the regulation of gap-junction channels that are respon- 
sible for the coordination of cellular activities in many tissues. Politi and Hofer 
model the propagation of hormone-evoked calcium waves observed in the liver. 
Plathe and 0yehaug consider spatial pattern formation in models of coupled cells. 
Finally, Dr. Volt’s contribution is devoted to questions of the design of biochemical 
systems, reviewing theoretical approaches to uncover their operating principles. 

^ Cohen, P. (2000) The regulation of protein function by multisite phosphorylation - a 25 year 
update. Trends Biochem. Sci. 25, 596-601. 
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1. Introduction 

A most fascinating challenge in neuron biology is a deep understanding of the 
mechanisms involved in long-term-effects (LTE) such as long-term-potentiation 
(LTP) and long-term-depression (LTD). The multiplicity of possibly important 
mechanisms is immense and is explored in a great variety of experiments [1]. 
However, the interpretations of those experiments are not as conclusive as they 
could be if it was possible to better compare experiments executed on different 
systems. We claim that a lot of detailed information on LTE is hidden in presently 
available experiments. 

One possible way to uncover this hidden knowledge is to construct a tool 
which is able to translate different experiments into a common language and then 
to compare them quantitatively. Such a tool is provided here for the analysis of 
the intracellular calcium concentration in presynaptic nerve terminals. It is very 
well established by experiments that LTE are mostly connected with dynamical 
changes in the calcium concentration [2]. Larger calcium concentrations induce a 
lot of mechanisms that may influence the transmission efficiency of synapses. This 
applies not only to the postsynaptic side of the synapse, where a calcium influx 
is believed to be necessary for the induction of LTP [3]. Intracellular calcium 
also induces exocytosis on the presynaptic side of the synapse and therefore is 
in part responsible for the release of neurotransmitter into the synaptic cleft [4]. 
It is believed that calcium/calmodulin dependent protein kinaises may trigger the 
amount of released neurotransmitter. This has been shown for example for the 
calcium/calmodulin kinase II in the squid giant synapse [5]. Therefore, we think 
that the intracellular calcium concentration is an appropriate observable to study 
the induction of effects as LTD and LTP from the very beginning. 

Already on this early level of LTE-induction a comparative quantitative eval- 
uation of different experiments may lead to new insights. Corresponding experi- 
ments have been done using different systems or using the same system under 
different conditions. In addition, a dependence of the presynaptic calcium dynam- 
ics on the corresponding target cell has been observed [6]. In order to compare 
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those experiments quantitatively one has to determine characteristics of the ex- 
periment that are important for the calcium dynamics and to introduce them into 
the model terminology. 

In the following we will develop a corresponding model which is based on 
previously developed models of the intracellular calcium dynamics. The number of 
models describing presynaptic calcium dynamics is comparably small [7-10]. The 
main focus of model work in this field lies on the postsynaptic calcium dynamics 
[7, 11-18] (for a review we refer to [19]). This is surely related to the fact that 
dendrites are better accessible in experiment than presynaptic boutons. Therefore, 
we use the knowledge which has been established on the postsynaptic side of 
the synapse in order to build a new model which can be used for comparison of 
presynaptic calcium dynamics in different experiments. Presynaptic measurements 
became more frequent due to technical developments in recent years [20]. 

The model has to be adjusted to specific experiments in a well-defined proce- 
dure, which is illustrated for the example of presynaptic nerve terminals in the rat 
neocortex [6]. The results found here are discussed in the context of LTE. For more 
details of the model and the results presented here we refer to [21]. This especially 
concerns a more detailed analysis of the model assumptions and the robustness of 
the results. 



2. The Model on the Level of Single Proteins 

We construct a new deterministic one-compartment model for the presynaptic cal- 
cium dynamics. The intracellular calcium concentration is changed in response to 
variations of the membrane potential by an ion-flux through voltage-gated calcium 
channels. The flux is driven by the electro-chemical gradient at the membrane. In- 
tracellular calcium is bound by an endogenous buffer. Only a small number of ions 
remains free and increases the concentration of free calcium. An increased calcium 
concentration is reduced by two types of transport proteins: ATP-driven calcium 
pumps and sodium-calcium exchangers. Their activity basically depends on the 
free calcium concentration. A leakage transmembrane calcium current is assumed, 
that compensates the activity of transport proteins and channels in the rest state 
of the cell. 

The model is formulated in terms of a set of ordinary coupled differential 
equations for the intracellular calcium concentration c: 

dc _ G Pi]Ju{9v{U),U{t),U{c))-ppJp{gp{c))-pEJE{gE{c))-\-L 

~dt ~ ^ 

where pu,p,E are the surface densities of the voltage-gated channels (HVA), the 
PMC A- type calcium pumps, and the sodium-calcium exchanger (type 1), respec- 
tively. 



‘7u,p,e = 5'u,p,e 7u,p,e 



( 2 ) 
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are the corresponding single protein currents. They are determined by the single 
protein open probabilities ^u,p,E 7 where depend on c according to standard 
Hill equations [22] for the pumps and the exchangers with Hill coefficient 2 [23] and 
1 [24], respectively. The open probability for the voltage gated calcium channels 
(HVA) obeys [25] 



dgx}{U) 

dt 



1 + exp 



U,/2 - u 




( 3 ) 



with r = 1ms the channel mean open time, Uxj^ — 3mP the half activation volt- 
age, and K = SmP the steepness factor. The values are taken from a representative 
HVA-channel [25]. /u is the voltage dependent current which is assumed to follow 
Ohms law with open channel conductivity of llyS. The electrochemical gradi- 
ent is calculated with respect to t/(c), the (calcium dependent) calcium reversal 
potential, which depends on c according to the Nernst equation. 

/p = 10“^^ A and /e = 40 10“^^ A are the maximum activities of the transport 
proteins. (7 is a geometry factor (ratio of surface and volume in the compartment), 
2 ; = 2 is the valence of the calcium ions, F the Faraday constant, and L the leak 
current which is determined by the steady state condition. 

is the stimulating transmembrane voltage function. We assume that the 
feedback effect of the calcium influx on the membrane potential is small with re- 
spect to standard action potentials. One may think of a voltage clamped situation. 
However, for large calcium concentrations as they may appear in high frequency 
stimulations, calcium buffers may saturate and the impact on the membrane po- 
tential may become non-negligible. Such effects are not considered in the present 
analysis. 

The buffers (endogenous and indicator) are treated in a quasi-steady state 
approximation, which claims that the calcium binds and dissociates faster than 
the typical time scale under consideration. Then the dynamical behavior of the 
buffers reduces to a correction factor in Eq. (1) which depends on the calcium 
concentration only: 



0,,(c) 



{K, + cf 



and ©i(c) 



{K^+cf 



( 4 ) 



Here 6„uix and z„iax are the total concentrations of the endogenous buffer and 
the indicator, respectively. are the corresponding dissociation constants. Note 
that it is important to include the indicator used in the experiment, because the 
indicator is basically acting as additional buffer and may drastically change the 
amount of free intracellular calcium. 

The differential equations are solved numerically and the solution describes 
the time-course of the calcium concentration resulting in response to single action 
potentials or to series of action potentials (as they are used to induce LTD or LTP). 
The above mentioned aspired generality of the model is reflected in a separation 
of the model parameters into three classes, described in the following subsections. 
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2.1. Universality 

The model is universal enough to be applicable to a wide class of different neuron 
types. To this end the model is based on the experimental knowledge about single 
proteins which are postulated to have neuron-type independent properties. These 
are the single protein characteristics px (including all derived physiological prop- 
erties) and the single protein transmembrane currents Jx^ where x stands for the 
corresponding type of protein (see e.g. [23-25]). 

2.2. Type Specificity 

The model is specific enough to be applicable to well defined neuron types. This is 
achieved by the introduction of measurable neuron-type specific parameters which 
has to be determined for each experiment separately. Basically, these are the pro- 
tein densities px in the membrane. As no space resolution of the calcium concen- 
tration is considered, these densities may be thought of as average values over 
the whole synaptic membrane. Also the concentration of the endogenous buffer 
^max and its dissociation constant Kt belong to the neuron-type specific class of 
parameters. Finally, the surface to volume ratio G of the synaptic compartment 
quantitatively determine the concentration changes due to transmembrane cur- 
rents. 

2.3. Condition Specificity 

The model includes enough general specifications in order to adjust the model to 
specific experimental conditions. The form and amplitude of the action potential 
U{t) is simulated with a system of coupled differential equations (not shown here) 
and can be adapted to the specific action potential used in experiment. The LTE- 
stimulation protocols used in experiment are simulated with a corresponding series 
of single action potentials. Intracellular calcium concentrations are generally vi- 
sualized with the help of calcium indicators. They act as an additional buffer in 
the cell and, therefore, may influence the calcium dynamics. In the model they are 
treated in complete analogy to the endogenous buffer and are characterized by the 
indicator specific dissociation constant Ki and the used indicator concentration 

^max • 



3. Adjustment to a Specific Experiment 

The idea of this semi-universal model for presynaptic calcium dynamics is to de- 
termine the universal parameters using single protein experimental data and to 
maintain the resulting values for the evaluation of different systems and experi- 
mental conditions. Universality has to be understood as the postulated statement, 
that universal parameters are not the parameters that are most sensitive to a 
transfer from one experiment to another. The adjustment of the model to a spe- 
cific experiment is achieved through the determination of the system-specific and 




Presynaptic Calcium Dynamics of Neurons 



67 



condition-specific parameters. Note that without any exception these are param- 
eters with direct physiological interpretation. Therefore, most of them may be 
accessible up to a sufficient precision in several experiments. 

In the described procedure the main part of the model is determined by 
sources that are independent of the experiment under consideration. Therefore, 
the value of the model is tested by its capability of reproducing the answer of the 
calcium concentration to single action potentials on the basis of those independent 
and fixed universal parameters. As in general not all specific parameters will have 
been determined in the experiment under consideration, we will fit the remaining 
unknown parameters to the single action potential calcium response. With the 
help of a thus defined model it should be possible to analyse the measured calcium 
transients evoked by LTD or LTP stimulation protocols. 

This has been executed for an experiment on single nerve terminals of pyrami- 
dal neurons in the neocortex of rats [6] : Most specific parameters used in the model 
are directly accessible in this experiment. This concerns for example the form of 
the action potential applied to the nerve terminal, the geometry (i.e. the surface to 
volume ratio G = 3/iJ.m), the concentration of endogenous buffer bumx = 120/iM, 
and the characteristics and concentrations of the used calcium indicator (magne- 
sium green). Therefore, we are in the situation that the whole model is a priori 
determined either by independent sources (concerning universal parameters) or 
by the available data from the experiment under consideration. The only un- 
known parameters are the surface densities of the calcium transport proteins. 
These are fitted to the measured calcium transient evoked by single action poten- 
tials (pu = 2.65/pm^, pp = 4000/pm'^, and pp = pp/30). The result, as Fig. 1 
shows, is that the megusured calcium transient is reproduced correctly. 



4. LTE Stimulation 

In order to check if the thus defined model has predictive power, we calculate the 
intracellular calcium transients evoked by series of action potentials with varying 
frequency. Basically, the model parameters remain unchanged. As in the corre- 
sponding experimental setup (see [6] Fig. 9) a different action potential (compared 
to Fig. 1) has been used, the action potential is adapted in the model and the 
channel densities are fitted to the single action potential calcium response, corre- 
spondingly (pu = 2.77//im^, pp = 3400/pm^, and pp = pp/30). In addition the 
indicator concentration is increaised, which is considerably higher in this experi- 
mental setup. Now the 10 Hz stimulus is applied and the result is shown in Fig. 2. 
The model result is in quantitative agreement with the calcium transients seen 
in the experiment: The intracellular calcium concentration reaches a new base- 
line level during the stimulation process which breaks down when the stimulus 
is switched off. The calcium concentration oscillates on top of the new baise line 
in coherence with the stimulation potential. The calcium reducing processes are 
slightly slow in the model. 
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Figure 1. The calcium transients evoked by single action poten- 
tials in single boutons of pyramidal neurons in the rat neocortex 
(100//M magnesium green). The dotted line shows the experimen- 
tal values [6] (we thank H.J. Koester and B. Sakmann for kind 
permission to reproduce the data), and the full line shows the 
model result with fitted protein densities. 



On this basis we can calculate the calcium transients in response to stimu- 
lation protocols with various frequencies. We find that calcium transients do not 
overlap for low frequencies typical for LTD-induction. The emergence of a new 
baseline in the calcium concentration at frequencies around 10 may be inter- 
preted as threshold for the induction of LTP. Note that this threshold frequency 
strongly depends on the used calcium indicator concentration. This is especially 
relevant for the interpretation of experimental results. A stimulation with frequen- 
cies around 50 //z (typical for the induction of LTP) leads to a more pronounced 
enhancement of the calcium baseline. This qualitative behavior is in agreement 
with experiments carried out on dendritic spines of pyramidal neurons [26]. 

5. Conclusion 

Our new model for transients of the presynaptic intracellular calcium concen- 
tration evoked by various stimulation protocols reproduces the general behavior 
observed in experiment. It is exclusively constructed with parameters that have 
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Figure 2. The calcium transients evoked by a lOHz stimulus in 
single boutons of pyramidal neurons in the rat neocortex (500^M 
magnesium green). The full line represents the model result and 
the dotted line the corresponding measured transients [6] (we 
thank H.J. Koester and B. Sakmann for kind permission to re- 
produce the data). The dashed line represents the best single ex- 
ponential fit to the experimental base line values [6]. 



a direct physiological interpretation. Its basis are the single proteins properties. 
The characteristics of single proteins are considered to be universal in the sense 
that they remain unchanged for different experimental setups. The model has been 
adjusted to a specific experiment that measured intracellular calcium transients in 
nerve terminals of pyramidal neurons of the rat neocortex. To this end the param- 
eters specific for this experiment have been extracted from it or, if not available, 
have been fitted to the single action potential calcium response. The model results 
turned out to be in quantitative agreement with the experiment. This applies not 
only to the presynaptic calcium concentration response to single action poten- 
tials but also to 10 Hz stimuli. We did not find any reason for the involvement of 
calcium-induced-calcium-release in the induction of LTE. However, it seems that 
an additional mechanism (e.g. calcium channel inactivation) may be necessary to 
understand the induction of LTD on the level of presynaptic calcium transients. 
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More generally, the separation of universal and specific parameters enables us 
to analyse different results observed in several experiments. With the help of the 
new model one may decide if those differences are significant or due to different 
experimental setups. 
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1. Introduction 

As biology begins to move into the “postgenomic area” the question arises of how 
complex biomolecular networks of living cells or bacteria function as dynamical sys- 
tems. Within the cytoplasm which is surrounded by a semipermeable membrane, 
the cell carries out thousands of chemical reactions and physical transformations 
which permit its own survival and reproduction. The cell must be able to respond 
to changes in the environment and to its own internal state, searching out raw 
materials and energy, avoiding toxins, repairing damaged parts and replicating. In 
order to meet all these demands of life the cell must be precisley regulated. All of 
these regulatory processes are carried out by the same molecular machinery that 
is being regulated: genes, proteins and metabolites [2], [33]. 

The proportion of identified components continues to increase and requires 
the development of useful models, including mathematical models as well as con- 
ceptual models, in order to understand these networks. Mathematical modeling 
is a proven tool for quantitative description of complicated systems and has been 
successfully applied in the engineering sciences. Mathematical modeling [29] can 
provide the necessary conceptual framework in order to: 

1. understand the responses of both normal and mutant organisms to stimuli; 

2. verify the consistency and completeness of reaction sets hypothesized to 
describe specific systems; 

3. integrate phenomena like crosstalk, positive or negative feedback within 
pathways or nonlinear interactions into a coherent picture of the operation 
of metabolic or regulatory networks; 

4. identify design principles for the biochemical architecture of biological 
systems; 

Prominent examples for these complicated biological networks are intracellu- 
lar signaling cascades, metabolic pathways and cell-cell communication. Because 
signal flow is tightly regulated with positive and negative feedback control and 
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is bidirectional with commands traveling from outside-in and inside-out, dynamic 
models that couple biophysical and biochemical elements are required to consider 
information processing during both transient and steady state conditions. 

Comparing cells with chemical plants, the approach of a modular modeling 
concept for biological systems seems to be a useful tool. In non-biological systems 
engineering this is generally carried out in a hierarchical fashion. Thus a modu- 
lar framework, treating subsystems of complex molecular biological networks as 
functional units that perform identifiable tasks might be applied to biological sys- 
tems. These functional units can then be represented [16], as composite modeling 
objects that consist of interconnected smaller modeling objects. The bottommost 
level of this hierarchy comprises basic entities representing single substances and 
reactions. Likewise, it is possible to model large systems in a transparent manner 
and thereby provide the complete information content of the system. 

Cytokines are local messenger molecules that transmit information between 
cells and have a major impact on growth regulation, cell division and apoptosis. 
Currently, more than 200 different cytokines have been identified, e. g. interleukins, 
growth factors and chemokines. The key event that initiates intracellular signaling 
pathways, is the binding of the cytokine to the receptor and receptor oligomeriza- 
tion. The autophosphorylated receptor triggers the binding of adapter molecules 
which then activates signaling cascades leading to gene expression. 
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Figure 1. Illustration of possible dynamic responses of signal trans- 
duction systems 



Each cytokine triggers not only a unique signal for a cellular response, but 
multiple and diverse biological functions, though the large number of signaling 
pathways that utilize similar molecular components. From the analysis of the dif- 
ferent pathways we know that they interact with each other at different signaling 
compounds and different levels in the signaling cascade. Thus the response to 
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a cytokine depends on the degree of interaction with other signaling pathways. 
Examining the proteins serving as signaling compounds it appears that their pri- 
mary function is the transfer and processing of information rather than the chem- 
ical transformation of metabolic intermediates. With the help of these proteins 
information is transferred from the cell membrane to the genome. In the signal 
transduction network they are linked into circuits that amplify and integrate in- 
formation signals or serve as information storages [2]. 

Until now our understanding of the system in a holistic manner has been 
very limited. The use of mathematical modeling to describe biochemical events 
triggered at the cell surface has been conisderably delayed due to the only recent 
characterization of receptor structures. Due to the significant advances that have 
been made in the field of molecular cell biology in the past decade, it is now 
possible to quantify the concentration of signaling compounds, to isolate certain 
compounds, to create knock-outs or study the translocation of signaling molecules 
with the help of fluorescence microscopy. Thus the bridge between qualitative 
and quantitative biology can be made. Due to the complexity of the signaling 
networks an interdisciplinary approach in order to understand the system as a 
whole is absolutely necessary. 

Two of the most common responses of living cells towards external stimuli 
are amplification and adaptation. Amplification enables a cell to transform small- 
est changes in stimuli into substantial biochemical changes. Adaptation enables 
the cell to measure relative rather than absolute changes and to counterbalance 
the system. For example adaptation can be achieved by reduction of cell surface 
receptors due to internalization and degradation or negative feedback mechanisms. 

Generally speaking, as shown in Figure 1, an impulse-like or a sustained input 
signal triggers a certain output signal leading to transcription or translation. The 
output signal might be regulated by positive or negative feedback mechanisms 
resulting in different output patterns. A sustained output signal is attained in the 
absence of adaptation. In contrast perfect adaptation results in a transient signal 
which reestablishes its initial level. Furthermore, oscillations might be observed as 
possible output signal [14]. 

In the past only parts of signal transduction pathways have been modeled 
mathematically as receptor-ligand interaction [18, 4], receptor internalization [30], 
the MAPK cascade [12, 5] or the caspase cascade [7]. From these models one could 
derive a new functional understanding of certain mechanisms as the significant 
signal amplification by the MAP kinase cascade and or the negative regulation of 
signals by receptor trafficking. 

The following will describe some ideas on how models of even larger systems 
can be derived and introduce some examples of possible applications of these 
detailed biochemical reaction networks. 
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2. Modeling Principles 

2.1. Introduction 

Figure 2 shows the general process that is undertaken in model development in 
order to study biological phenomena. First biologists and modelers have to coop- 
erate to determine which questions should be solved by the model. Based on this 
knowledge, appropriate analysis methods for the modelling have to be selected. At 
the beginning of the model building process there are many decisions that have to 
be made, either explicitly or often implicitly, e.g. whether to generate a specific 
or rather general model, whether the model should be deterministic or stochastic, 
whether the problem should be treated in a continuous or discrete way or whether 
the model should be qualitative or quantitative. From these “meta - modeling” 
questions it is obvious that there is no single definition for the “best model”. A 
general rule is to keep it as simple as possible to begin with. 

In the case of mathematical modeling of metabolic networks Ordinary Differ- 
ential Equations (ODEs) seem to be adequate to simulate these systems dynami- 
cally. As signal transduction networks are very similar in comparison to metabolic 
networks, ODEs seem as a suitable starting point as long as there is no need for 
reaction-diffusion approaches, which would then require the use of Partial Differ- 
ential Equations (PDFs). As the concentration of most of the signaling molecules 
seems to be rather high (> InM) in cells, a stochastic approach seems not to be 
necessary for most systems. 




Figure 2. Schematic representation of the major steps in model 
development of biological models 

When the modeling method has been selected, the model structure has to be 
defined, which means in this case the definition of the biochemical reactions. When 
an operating model has been developed, it has to be made sure that the model 
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passes several tests before it is applied to a real-world problem as shown in Fig. 2. 
One method is to test the model against data for which we have expected answers. 
This insures that the model indeed performs as expected. Another test would be a 
negative test, thus the model should not perform correctly with false input data. 
This methodology ascertains that the model performs well within its defined do- 
main. Finally the question of stability and sensitivity has to be addressed. Stability 
can be thought of as robustness of the system. How robust a system is, defines 
whether or not the system settles to its solution space when excited over a broad 
range of input conditions. In contrast, sensitivity analysis is a measure of how 
much error is introduced by parametric variation. 

Thus, it might take several iterative rounds between model structure defini- 
tion and model verification until an operational model is achieved. 



2.2. Available Modeling Tools 

Starting with a verbal or graphical model of possible interactions this has to be 
implemented into a mathematical model. During recent years the explosion of 
interest in systems-level modeling and analysis of biochemical networks has been 
accompanied by the expansion in the diversity of software tools. In the following 
we will give a short overview about existing packages. 

The Virtual Cell (www.nrcam.uchc.edu) by J. Schaff and L. Loew, offers 
the possibility to model biological reaction diffusion problems, e.g. propagating 
calcium waves [28]. Furthermore, there are a number of simulation packages avail- 
able as the E-CELL environment developed by M. Tomita et al. [32], Gepasi, 
which is probably the oldest existing simulator that was started in the 1980’s 
by P. Mendes [21], PathwayPrism, Physiome Sciences, Inc. a commercial simu- 
lation tool, Jarnac, developed by H. Sauro at Caltech, DBSolve by I. Goryanin 
[9] as well as Promot/DIVA, developed at the Max Planck Institute of Dynam- 
ics of Complex Technical Systems in Magdeburg [8], which provides a number 
of unique numerical analysis tools for sensitivity analysis or bifurcation analysis. 
StochSim written by Carl Firth et al. at the University of Cambridge [27] is a 
stochastic simulator for biochemical reactions. Here we have listed only a fraction 
of the existing packages. More detailed information for the packages described 
above can be found on the web e.g. http://www.cds.caltech.edu/erato/links.htrnl 
or http://bioinformatics.org. In the following we present Promot in more detail 
taking the example of the EGF induced MAP kinase cascade. All software packages 
intend to be tools for experimentalists as well as theorists to test their hypotheses 
and are available as shareware. Most of them are made to build models based on 
ODEs except the Virtual Cell which focuses on reaction-diffusion problems that 
are described with PDEs and Stochsim which applies a stochastic approach. In or- 
der to enable the exchange of models generated with the help of different modeling 
packages, the ERATO Systems Biology Workbench has been created [13]. 
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2.3. Model Generation 

As in the near future no modeling and simulation package can provide tools for ev- 
ery analysis task, the SBW can provide one possibility of combining the strengths 
of different tools and to exchange models with the XML-based modeling language 
SBML. Independent of which software package is used, the process of model gen- 
eration asks for several steps: 

• Search for an interesting biological system 

• Development of a detailed biochemical reaction network with the help 
of literature and www sites like Biocarta (www.biocarta.com). Transpath 
(www.transpath.gbf.de) etc. 

• Search for quantitative data and published rate constants 

• Parameter sensitivity analysis in order to determine parameters that can 
be estimated 

• Parameter estimation 

• Model validation, test of known hypotheses 

• Creation of new hypotheses, model prediction 

• Experimental validation of model dericed hypotheses, predictions 

• Design of virtual experiments 

In the following we will explain the above steps in further detail. Once a sys- 
tem has been defined, there are different possibilities for representing the biochem- 
ical reaction network graphically. Some of the simulation tools described earlier 
allow a graphical model generation. In principle there are two approaches, which 
are currently utilized: (i) based on traditional biochemical representations [15], or 
(ii) based on traditional engineering approaches as flow sheets Promot 
[8] or bond graphs Pansym [31]. 

On the most basic level both Promot and Pansym use so-called storage 
elements which contain the system’s dynamic state variables and represent the 
different species and signaling compounds. The interactions between the storage 
elements, like biochemical reactions, can be found in so-called transformer ele- 
ments. These transformers act as connections between the storages and provide 
reaction rates whereas the storages calculate balance equations for the species. 

Starting from basic interactions more complex modular subsystems can be 
created. In order to keep the model simple and transparent the representation 
according to the Modular Modeling Concept [16] uses different levels of complex- 
ity. On the lowest level the elementary reactions are represented and on higher 
levels we find so-called functional units which represent a certain module of the 
signal transduction pathway like the MAP kinase cascade as shown in Fig. 4. 
These functional units appear on each level of complexity and more complex units 
result from the aggregation of smaller less complex units. Regarding the mathe- 
matical modeling of cellular systems, this modular structure raises the possibility 
of independently developing mathematical models for each of the functional units. 
These submodels, which correspond to functional units in the biological system. 
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can then be connected to larger models, describing the particular biological sys- 
tem of interest. It also offers the possibility of providing implementations of the 
functional unit in different detail resolution that meets the requirements of the 
current modeling task. Possible criteria for structuring complex biological systems 
are: (i) a common physical goal, (ii) a single genetifc unit or (iii) a common signal 
transduction system [19]. 

Independent of which software package is used to create the model the basic 
idea is to translate a biochemical reaction network consisting of n species Xi and 
V biochemical reactions into a system of ODEs. 

0^1 J • + • • • + an,j ' Xn -4 Pi j • Xi H- • • • + f3nj • Xn (1) 

Where k^ is the kinetic constant of the reaction, aij and f3ij are the sto- 
ichiometric coefficients for substance X^’s involvement as educt or as product, 
respectively. Applying mass action kinetics, the mathematical equations (ODEs) 
are retrieved accordingly based on the biochemical reaction schemes represented 
in the appendix. As proposed in [26], the differential equations of the different 
compounds Ci {i = 1 ... n) can be written in a canonical form as shown by the 
following equation: 

n^r-' (2) 

j=i les, 

Here, Sj represents the set of species actually participating in reaction j as 
educts, i.e. aij > V / G Sj. 

The mathematical model is a system of ODEs S(c,u,p,t) which depends on 
the state variables c (corresponding to the concentration vector c, which was in- 
troduced earlier), the input parameters u (like number of receptors, ligand con- 
centration etc.), the model parameters p and time t. Thus S can be written as: 

S(c, u, p, t) = c - f(c, u, p, t) = 0 (3) 

and t > to, the functional f and the initial conditions c(to)=Co- 

Once the model is composed and a training set of experimental data either 
from literature or own experimental data is defined, the unknown kinetic param- 
eters have to be estimated. In order to define those kinetic parameters which 
can be estimated with the help of the available experimental data a sensitivity 
analysis should be carried out. Biochemical reaction networks show a certain ro- 
bustness concerning parameter variations or the variation of initial conditions [35]. 
This seems to be characteristic for biochemical reaction networks. Only some pa- 
rameters show a certain sensitivity towards parameter variations. The sensitivity 
analysis determines the sensitivity of the outcomes of a system to changes in its 
parameters (as opposed to changes in the environment). If a small change in a 
parameter results in relatively large changes in the outcomes, the outcomes are 
said to be sensitive to that parameter. This may mean that the parameter has to 
be determined very accurately, or that this parameter may be a useful target for 
changing the systems behavior. 
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The state sensitivities, which are the partial derivatives of x with respect to 
p - are derived from the variation equation 
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whereby the Jacobians Jc for the states and Jp for the parameters are given by 
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It has to be emphasized that the sensitivities are time dependent and local 
with respect to parameter space. 

At this point a set of parameters has been determined, which can describe the 
experimental data. Now the quality of the model has to be tested with experimental 
findings which have not explicitly been included into the model, like overexpression 
or deletion of a certain protein. If the simulation results correspond well with the 
experimental findings the model will be adequate for making predictions which 
have to be tested afterwards experimentally. In the following we will present some 
examples based on the EGF induced MAP kinase cascade and explain how such 
mathematical models can help to plan experiments, lead to new hypotheses and 
in general lead to a better understanding of the mechanism and control of signal 
transduction pathways. 



3. Predictive Potential and Application of Mathematical Models 

In this section we present the modeling representation of the EGF induced MAP 
kinase cascade in Promot. Furthermore, we will show simulation results of the 
mathematical model as well as its predictive potential and the ability to gain new 
mechanistic insights with the help of such models. 

The EGF induced MAP Kinase cascade may be divided into several parts as 
Fig. 3 shows: 1. EGF binding, receptor dimerization and autophosphorylation, 2. 
signaling complex formation at the cytoplasmic tail of the receptor (association of 
GAP, recruitment of Shc*-Grb2-Sos and Grb2-Sos), 3. and 4. Ras activation by 
two separate pathways leading each to the stimulation of the MAP kinase cascade. 
5. receptor internalization, recycling, degradation and de novo synthesis. The main 
characteristics such as the receptor activation, the two independent pathways of 
receptor complex formation and the MAP kinase cascade can be represented due to 
the modularity in a very transparent fashion as depicted in Fig. 4, which represents 
the EGF induced MAP kinase cascade according to the Modular Modeling Concept 
in Promot. 

Based on literature data, a first model was developed, which allowed predic- 
tions about the EGF concentration dependence of ERK activation. This rather 
qualitative model predicted that the ERK activation is maximal over a broad 
range of EGF concentrations, from 50 ng/ml to 0.5 ng/ml, which has never been 
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Figure 3. Biological Scheme of the EGF induced MAP kiiictse cascade 



shown before experimentally. Due to these unexpected results, we carried out ex- 
periments, which were in good accordance with the simulation results as depicted 
in Fig. 5. 

The time-course of EGF receptor phosphorylation, the increase in total phos- 
phorylated She molecules, activation of Ras-GTP as well as the phosphorylation 
of Raf, MEK and ERK is resolved for activation by different EGF concentrations. 
Autophosphorylation of dimerized EGF receptor (EGFR2*) is known to peak af- 
ter 15 s for concentrations between 10 and 120 ng/ml EGF [22] which is in good 
agreement with the simulation results shown. At 50 ng/ml EGF, the model predicts 
complete phosphorylation of EGF receptors of the cell (50 000) within 15 s. The 
decline in the total number of phosphorylated EGFR2* calculated for each con- 
centration as represented in Fig. 5 A is in accordance with literature data [10, 36]. 
After reaching the maximum at 0.08 nM (0.5 ng/ml) EGF, only 50 % receptors 
are phosphorylated, and even less at lower EGF concentrations. The time-course 
of EGF receptor autophosphorylation, the increase in total phosphorylated She 
molecules, activation of Ras-GTP as well as of the phosphorylation of Raf, MEK 
and ERK is resolved for activation by different EGF concentrations. Autophos- 
phorylation of dimerized EGF receptor (EGFR2*) is known to peak after 15 s for 
concentrations between 10 and 120 ng/ml EGF [22] which is consistent with the 
simulation results shown. 
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Figure 4. Representation of the EGF induced MAP Kinase Cas- 
cade according to the Modular Modeling Concept in Promot 

In contrast to She phosphorylation, the activation of Ras-GTP showed a 
clear concentration dependent signal, with increasing amplitude and earlier peaks 
at higher concentration (Fig. 5C). The maximum number of Ras-GTP is predicted 
at approximately 1 min after EGF stimulation, followed by a rapid recovery to the 
inactive GDP bound state by 5 minutes [36]. According to our model the ratio 
of the maximal number of Ras-GTP to the total number of Ras-GDP is about 
0.25, which is consistent with experimental findings. [3] A similar signal pattern 
for Ras-GTP occurs as for Raf kinase activation [34] MEK and ERK phospho- 
rylation, again with the corresponding peak maxima, accelerated with increasing 
relative amplitudes. Interestingly, the amplification steps from Raf to MEK (Fig. 5 
D and E) are similar for all EGF concentrations (about 1: 10); however, from MEK 
to ERK (Fig. 5 E and F) a tremendous amplification step of almost 40- fold at 50 
ng/ml EGF and of about 80 -fold at 0.02 nM (0.125 ng/ml EGF) is observed. 
Although the inititial signal of receptor autophosphorylation is below 10 % of 
maximum at this low EGF concentration (Fig. 5A), the model predicted that 
ERK activation still reaches about 70 % of the maximum amplitude. Thus, in 
this last step of the MAP kinase cascade, a switch like activation of ERK should 
occur. This is in accordance with a steady state model of Huang and Ferrel [6]. 
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Figure 5. Concentration dependence of EGF induced MAP Ki- 
nase Cascade. Simulation and experimental results: black dot 8 
nM (50 ng/ml), gray square 0.08 nM (0.5 ng/ml), black triangle 
0.02 nM (0.125 ng/ml) EGF 



In our model, the EGF induced MAP kinase cascade was calculated by dynamic 
and concentration dependent resolution. The pattern of ERK phosphorylation is 
similar over two orders of magnitude of ligand concentration resulting in the same 
amplitude and similar duration. Interestingly, the kinetics of the peak maximum 
correlates rather with the temporal pattern of phosphorylated She bound to the 
receptors than the initial kinetics of EGFR2* phosphorylation. The computational 
signal patterns for ERK activation were experimentally confirmed by kinetic and 
dose response of ERK phosphorylation in HeLa cells. The normalized data calcu- 
lated from the densitometric evaluation of the bands were implemented in Fig. 5F 
and are represented by symbols. In the concentration range of 8 nM (50 ng/ml) to 
0.08 nM (0.5 ng/ml) EGF induced ERK-1/2 phosphorylation showed maximum 
signal with delayed peak maxima between 1 and 10 min, but identical signal dura- 
tion (Fig. 5F). At saturating EGF concentrations, ERK-1/2 activation is known to 
peak after 1 to 5 min, depending on the cell type and to decline thereafter. [11, 1]. 
Of note, only at a concentration of 0.02 nM (0.125 ng/ml) EGF, was a significant 
reduction in peak height noticed, and a delay of 15 min. This concentration is just 
below the KD of the receptor. 
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Interestingly, the comparison of the time-courses for the EGFR2* and ERK 
activation at 0.08 nM (0.5 ng/ml) indicates that the maximum activated EGFR2* 
is not important for the ERK response as shown in Fig. 5 A and F. The maximum 
ERK phosphorylation appears 10 min before the maximum receptor phosphoryla- 
tion is attained. Thus, it seems that the initial velocity of receptor phosphorylation 
determines the signal output. 

Furthermore, the simulation results for a mutant overexpressing EGF recep- 
tors showed a sustained activation of ERK [24] , which has also been observed ex- 
perimentally in tumor cells expressing high numbers of EGF receptor [23] . Whether 
the ERK signal is sustained or transient it is believed to trigger either proliferation 
or diffentiation [20]. 

Besides the mutant overexpressing EGF receptors, the model predicts suc- 
cessfully the deletion of the adapter molecule She. For high EGF concentrations 
(~nM) there is no difference in the ERK activation observable, which corresponds 
well with the experimental observations. In contrast the She pathway serves as 
an amplifier of the ERK activation at low EGF concentrations (~ O.OlnM) [17]. 
These experimental findings, which have not been taken into account during the 
model development, and are consistent with the simulation results, validate the 
quality of the model. 



4. Conclusions and Outlook 

In this chapter, we give a short overview of different approaches, which can be 
applied to build mathematical models of biological systems. However, we focus on 
mathematical models of signal transduction pathways. Next to general principles 
of the modeling process we present currently available software packages. 

Based on the example of the EGF induced MAP kinase cascade, we show 
that it is possible to create mathematical models of signal transduction pathways 
with a certain predictive potential. The mathematical model of the EGF induced 
MAP kinase cascade is able to predict the ERK response as a function of EGF 
concentration, EGF receptor number or even for dominant negative She mutants. 

Likewise, mathematical models can help to plan experiments for example to 
define the time points when measurements should be taken in order to observe the 
most interesting phenomena. Furthermore these models can help to test hypothe- 
ses and create new hypotheses as shown above or in the example of a model of 
TNF receptor crosstalk [25]. Particularly in respect to pathway crosstalk, mathe- 
matical models in combination with quantitative biochemical data will lead to a 
mechanistic understanding of the regulation of often opposite cellular responses 
like differentiation / proliferation and apoptosis (programmed cell death). 

In the future these kinds of models could to be applied to drug target iden- 
tification and drug development. Virtual inhibitors can be included into these 
mathematical models and likewise lead to the identification of target proteins and 
give an idea of the biochemical properties of the optimal inhibitor. Thus, in the 
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future due to a better understanding of generally identifiable mechanisms realized 
in signal transduction pathways, the engineering of signal transduction pathways 
will be made possible, which then leads in turn to a more specified treatment of 
diseases. 

The combination of mathematical modeling and biological experiments opens 
a new system level understanding of biological systems, which then in turn allows 
a specific engineering of these systems. 
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1. Introduction 

Voltage dependence of gating of ion channels has taken the center-stage in mem- 
brane biology. It is known that electric fields exist or arise in living cells and tissues 
affecting conductance of ion channels. Several experiments have pointed out that 
channels in a membrane behave in a cooperative manner [1,2]. This cooperativ- 
ity depends on both internal parameters, e.g. structural parameters, and external 
factors, e.g. membrane potential. In our previous papers we had suggested that 
mutual interactions among the protein channels on a cell membrane matrix is a 
viable mechanism of controlling ion channel conductance [3,4]. A statistical me- 
chanical model was proposed to deal with the channel-channel interactions. One 
useful measure of channel-channel interaction is the number of ion-channels open- 
ing under voltage-clamped conditions. So far, no quantitative relation between 
the number of channels opening collectively and the externally applied potential 
across the membrane has been established. In this paper, we propose an empirical 
quantitative relation between the said quantities based on bilayer electrophysio- 
logical experiments. We further propose possible mathematical relations for the 
voltage dependence of opening probabilities of channels, behaving collectively or 
singly, on the basis of the aforesaid model. This investigation was carried out on 
multi-channel ensembles of gap junctions, a representative of the class of passive 
diffusion channels, on lipid bilayer membranes (BLMs). It may be mentioned here 
that gap junctions facilitate cell-to-cell communication in eukaryotes by passive 
diffusion of ions through low resistance passages [5,6]. These are plasma mem- 
brane specialized proteins forming transmembrane channels in both excitable and 
non-excitable cells [7-13]. The latter proved to be very important in embryonic 
development and differentiation [14,15]. Recently it has been reported by several 
workers that gap junctions behave as electrical synapses and thus play an impor- 
tant role in neuronal communication [16,17]. In the present work, we look for the 
physiological consequences of the collective opening of gap junctions, in addition 
to a quantitative relation between the number of channels open and the externally 
applied potential across the membrane. 
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2. Materials and Methods 

Chemicals: Lipids were bought from Avanti Polar Lipids (Alabama, USA). Tris, 
HEPES, Potassium Chloride, Magnesium Chloride and all other chemicals were 
obtained from Sigma Chemical Co. (St. Louis, MO, USA). 

Isolation of Gap Junction: Gap junction protein was isolated from Wistar 
rat liver plasma membrane following alkali digestion method [18]. Concentration 
of the protein was checked by the modified method of Lowry [19]; and homogeneity 
of the protein was checked in 10% SDS PAGE [20]. 

Bilayer Experiments: Bilayer membranes (BLMs) were formed across a 150 
fim diameter aperture in a Delrin (R) cup (Warner instruments, USA) with Di- 
phytanoyl Phosphatidyl Choline and Cholesterol (6:l::w:w) solution in n-decane 
(21 mg/ml). Channels were incorporated into the membrane by fusing the prote- 
oliposomes with it in a symmetric buffer solution (10 mM HEPES, 500 mM KCl, 
5 mM MgCl'i, pH 7.4). Single channel currents were recorded for different hold- 
ing potentials with an Axopatch 200 A integrating patch clamp amplifier (Axon 
instruments, USA) and stored in unfiltered form (5 kHz bandwidth) on video cas- 
settes after digitization through an analog to digital converter (VR lOB, Instrutech 
Corp., USA). The recordings were analyzed using Axodata and Axograph software 
(Axon Instrument, Foster City, CA) in an Apple Macintosh computer after filter- 
ing through 8 pole Bessel filter (902 LPF, Frequency Devices, Haverhill, MA) as 
has been mentioned in our earlier reports [21-24]. After establishing the single 
channel currents of gap junction for different holding potentials in our experimen- 
tal conditions, multi-channel recordings were performed and analyzed in the same 
way. 



Determination of Number of Reconstituted Gap Jimction Channels (n) in 
BLM: The most probable number of channels open (n) at a particular holding 
potential is determined from the ratio of the current {Iprob) flowing through the 
most probable multi-channel at a particular holding potential, to the single channel 
current (Is^nf^/f ) at the same potential. 



^prob 

^single 



= n 



( 1 ) 



The most probable multi-channel at a given holding potential is determined 
from the maximum opening time count, as obtained from amplitude histograms 
of channel recordings. Isingic refers to the averaged channel current measured in 
single channel preparations. In a multi-channel preparation, the current levels 
at a number of clamping potentials, corresponding to the smallest open clusters 
observed, were actually found to be integral multiples of Isingie^ This observation 
then leads us to assume that the contribution of a single channel current to the 
total current does not vary significantly from cluster to cluster. The maximum 
possible error in the value of n is estimated from the sum of the possible errors 
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in the calculations of Iprob and Isingie^ The error in Iprob is estimated from the 
half-width of the highest peak in the amplitude histogram, while that in Isingie is 
estimated from the standard deviation in the ensemble- averaged value. 



3. Results and Discussions 

The representative current traces of gap junctions incorporated in lipid bilayer 
at four different voltages, shown in Fig. 1, indicate the existence of open states 
in clusters. Interestingly, there are small as well as large opening of channels as 
observed in the traces. This indicates that instead of opening (or closing) one by 
one in a sequence, a number of channels open (or close) together in clusters of 
varying size at random. In other words they behave collectively. A plot of the sin- 
gle channel current calculated as per the method described above, versus applied 
holding potential (I-V plot) is shown in Fig. 2. The latter is used in estimating the 
most probable number of channels participating in a multi-channel opening (n) 
at a particular potential (V) applied across the BLM as defined in Eq. (1). These 
have been enumerated for several membrane preparations. One set of representa- 
tive plots is shown in Fig. 3 (a) and (b). It may be noted that the results have not 
been averaged for different membrane preparations, because the total number of 
channels incorporated in lipid bilayer changes with membrane preparation leading 
to a significant difference in the number of channels open. This is possibly due 
to the fact that the extent of collective opening of channels depends on the total 
number of channels incorporated. Fig. 3 shows that n varies with the membrane 
potential in a nonlinear fashion. Within the experimental range of potential, n 
initially increases with increasing magnitude of holding potential, reaches a max- 
imum and then ‘‘oscillates” around the maximum with further increcise of the 
same. It is minimum near the zero of holding potential. The number of channels 
opening is seen to reach a maximum at a membrane potential of around 70 mV. 
As a whole, the entire n-V profile is sigmoidal in nature for both positive and 
negative potentials. The pattern has been qualitatively observed to be consistent 
in a number of experimental preparations (20 different sets of observations) with 
different number of gap junction channels incorporated in the BLM. 

A suitable quantitative expression that fits the above mentioned experimental 
data is given by the following modulated tan-hyperbolic function 

V V -Vn n 

n = ao{l ecos — )tanh { — — — ) (2) 

m Vs 

where ao, e{< 1), m, Vq and Vg are constant parameters of the system under a 
particular experimental condition. The best-fit curves for n-V plots as obtained 
by suitably choosing these parameters, are also shown in Fig. 3 (a) and (b). The 
equation together with the plot show that the response of the system (expressed 
through n) is asymmetric around the zero of applied voltage. The parameters in 
Eq. (2) can be interpreted as follows: uq is the maximum number of channels 
opening and is of the order of the number of channels incorporated; VJ), around 




92 



P. Ghosh, D. Bose and S. Ghosh 



"20 mV 

50 pA 



1 sec 





-60 mV 




Figure 1. Representative multi-channel current traces of gap 
junction protein in planar BLM composed of Diphytanoyl Phos- 
phatidyl Choline/n-Decane with symmetrical solutions of 500 mM 
KCl, 10 mMHEPES, 5 mM MgCl 2 , pH 7.4. BLMs were clamped 
at different holding potentials with respect to ground as indicated 
on the left-hand side of the figure. Data were filtered at 1 kHz and 
sampled at 2 kHz. 



zero, is the cutoff potential below which the channels would not preferably open; 
e and m indicate the degree of modulation; and Vg is a scaling potential. We had 
proposed a model on the collective interaction of ion channels based on statistical 
mechanics [3,4]. The expression for the fraction of channels open {9) as derived on 
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Figure 2. The steady-state single channel current versus poten- 
tial plot (I-V) for gap junction channel from rat liver in lipid 
BLM. The experimental conditions and specifications are similar 
to those in Fig. 1. Each current value is averaged over 5 indepen- 
dent experiments. 



the basis of the above model is given by the following expression 

(l-s) 



9 






^(1 - s)‘^ -h 4cr‘^ 



) 



(3) 



where a and s are measures of the probabilities of single and collective channel 
opening respectively, both being functions of membrane potential and A is given 
by the expression 

A=i{(l + s) + \/(l-s)2 + 4<7'^} (4) 

The most probable number of channels open (n) is then equal to 0 times the 
total number of channels incorporated. Comparison of Eqs. 2 and 3 indicate that 
a and s have potential dependence qualitatively similar to that of n. It may be 
mentioned here that such a non-linear pattern is indicative of collective behavior 
as observed in a number of physical and biological systems [25-27]. The forces 
involved in collective gating of the channels are both cooperative and inhibitory. 
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Figure 3. (a) and (b): Representative plots showing the vari- 
ation of the most probable number of channels participating in 
a multi-channel opening (n) of rat liver gap junction as a func- 
tion of holding potential (V). Experimental conditions are similar 
to those of Fig. 1. Scattered points indicate experimental values. 
Best-fit curves (bold lines) are drawn on the basis of the Eqn. 
(2). The simulated values of parameters (uo, 6, m, Vo and Vs ) are 
indicated in the inset. 



It has been argued that opening of one channel would lead to a change in ion 
flux, which might affect the gating of neighbouring channels [3]. On the other 
hand, flow of cations and anions in different sides of the channel would create 
diffusion potential against the externally applied potential causing channel clo- 
sure. Moreover, phosphorylation of gap-junctions by extrinsic protein kinases A, 
C, or by effectors of protein kinases, cAMP, TPA etc. [28-30], or interaction with 
some metabolites, e.g. Ca"*"”*", cAMP above critical concentrations are found to in- 
hibit the channels. Opening of some gap junctions for sometime is likely to cause 
passage of the aforesaid metabolites leading to closure of neighboring channels. 
The observed regulation of gap junction multi-channel current through collective 
behavior has very significant biological consequences. In a cell membrane, the 
channels remain in numbers. When there is a weak external electric field (V close 
to zero), the multi-channel current is low due to less number of channels open, 
indicating lesser degree of collectivity. As the magnitude of external electric field 
increases, larger number of channels open at a time to give higher channel cur- 
rent. However, the increase is nonlinear. As a result of mutual interaction and 
collectivization, the channels begin to inhibit opening beyond some critical field. 
In other words, even if there exists a strong external field the cells would avoid 
opening up all the ion-channels, thus protecting leakage of necessary metabolites 
and electrical efflux. For example, in a developing embryo the cell-to-cell com- 
munication depends on the flux of ions and morphogens across the membrane. A 
sudden and large disturbance in this flux due to an external electric field would 
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lead to communication errors among cells, hence, developmental defects. Here, we 
hypothesize that the phenomenon of channel collectivity holds for other passive 
diffusion channels, if not for all the ion-channels. Hence, the sigmoidal pattern 
of n-V plot is likely to exist in a natural membrane too. If so, then this could 
be an effective mechanism of the cellular protection from external electrical or 
electrochemical disturbances. It may be mentioned here that neuronal cells, e.g. 
neurons, have gap junctions which form electrical synapses and are involved in 
neuronal communication and electrical signaling, a process essential for cognition, 
recognition and memory [31-35]. The nervous system dominantly works through 
exchanging messages and nonlinear computation using the rate of production of 
pulses of activity by a nerve cell as the signal or information being sent to another 
cell. It is believed that the aforesaid efflux rate is controlled via the change in the 
number of ion-channels open. Looking into our observation of sigmoidal variation 
of n with P, we find that the phenomenon could be one of the mechanisms by 
which neuronal communication is controlled and maintained. To conclude, in this 
work we have established the expression for the voltage dependence of the number 
of gap-junction channels opening collectively as well as the opening probabilities 
of the channels. This has been done on the basis of a statistical mechanical model 
combined with empirical data on bilayer electrophysiology. 
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1. Introduction 

Changes in the intracellular concentration of the second messenger Ca^^ represent 
one of the first cellular responses to a multitude of extracellular stimuli. Typically, 
agonist-receptor-mediated activation of phospholipase C (PLC) generates inosi- 
tol 1,4,5-trisphosphate (InsP^), which induces, through InsPa-sensitive channels 
(InsP.i receptors, InsP^R), Ca^^ release from internal stores into the cytoplasm. 
The further activation of InsP.iR by cytosplcismic Ca^“*" causes calcium-induced 
calcium release (CICR). The temporal and spatial characteristics of Ca*^"^ signals 
exhibit a high plasticity, depending on the dose and type of stimulation [32]. In 
hepatocytes and many other cell types, one observes sustained oscillations in Ca^“*" 
concentration, the frequency of which increases with the agonist dose. These os- 
cillations originate at a particular cytoplasmic location and spread as periodic 
concentration waves across the cell. There is now increasing evidence that Ca^“^ 
signals are not limited to single cells but can propagate to neighboring cells in the 
form of intercellular Ca^^ waves. Intercellular Ca^“*~ waves have been observed, 
e.g., in airway epithelial cells [29], glial cells of the nervous system [5], hepatocytes 
[33], the intact liver [28], articular chondrocytes [7] and the blowfly salivary gland 
[39]. Intercellular communication can take place through gap junction channels 
(see [13]) and/or via an extracellular route [16, 30, 15]. 

In this study, we focus on the intercellular coordination of calcium oscillations 
in systems such as the liver, chondrocytes and the blowfly salivary gland. In the 
intact liver, hepatocytes are extensively coupled by gap junctions, and large-scale 
repetitive calcium waves are observed when the organ is perfused with vasopressin 
[28, 27]. These waves initiate at a well-defined region of each liver lobule and propa- 
gate to a large number (>100) of cells. Intercellular coordination of Ca^~^ signals is 
also found in freshly isolated hepatocyte multiplets that retain some degree of gap- 
junctional coupling [33]. Multiplets respond with nearly synchronous oscillations 
to hormonal perfusion. Synchrony is disrupted, if the gap junctions are blocked. 
Under these conditions, stimulated cells are found to oscillate with widely differ- 
ent frequencies when uncoupled. To describe the Ca^“^ dynamics in hepatocyte 
multiplets, two kinds of mathematical models have been developed. The model of 
Dupont et al. [9] considers intercellular coordination by InsPa, accounting explic- 
itly for cytoplasmic and gap-junctional diffusion of InsP^ and cytoplasmic Ca^~^ 
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diffusion. Hofer [18] and Bindschadler and Sneyd [2] investigate the effects of junc- 
tional Ca^“^ diffusion. The latter two studies focus on gap-junctional coupling, and 
do not consider cytoplasmic Ca^“*“ diffusion. They show that due to the interplay 
of CICR and gap-junctional diffusion, cells with different intrinsic oscillation 
frequencies can actively synchronise. Recently, the stochastic versions of the deter- 
ministic models by Dupont et al. [9] and Hofer [18] were investigated by Gracheva 
et al. (2001). 

In this paper, we aim to extend the mathematical modelling from multiplets 
of two to three cells to larger cell arrays that are characteristic of the intact liver. 
The large range and repetitive nature of Ca^“^ waves in the liver indicate that a 
regenerative process is involved in wave propagation [28]. As in Hofer [18] and Bind- 
schadler and Sneyd [2], we assume that signal regeneration is due to CICR, and 
thus signal propagation primarily relies on gap-junctional Ca^“*“ diffusion. More- 
over, the cytoplasmic diffusion of Ca^"^ is strongly controlled by Ca^^ buffers [1] 
and may influence intercellular waves [36]. Therefore, we extend the previous stud- 
ies by explicitly including intracellular Ca^“^ diffusion. Furthermore, we account 
for the generation, and diffusion of the calcium-releasing messenger InsP;i. First, 
synchronisation in a pair of coupled non-identical cells is analysed. Specifically, we 
focus on how Ca^+ buffers can regulate the coordination of Ca^“^ signals. We find 
that the degree of intercellular coupling needed for synchronisation depends on the 
composition of cytoplasmic buffers, and, in particular, the diffusivity of buffers. 
Intercellular diffusion of InsP;^ is found to facilitate synchronisation. Second, peri- 
odic intercellular waves in a linear array of cells are studied. A central question is 
how coordinated waves arise in an ensemble of heterogeneous cells. We show that 
a local pacemaker can be sufficient to reproduce the unidirectional waves observed 
in the experimental systems. 



2. Model Description 

Calcium dynsimics. In the following a short description of the model will be given. 
A linear array of N connected cells of length L is considered. Let u(x, t), u(x, t) and 
w{x, t) be the Ca'^^ concentration in the cytosol, the Ca'^^ concentration in the 
main calcium store, the endoplasmic reticulum (ER), and the InsP;^ concentration, 
respectively; x is taken to be the space coordinate along the array. The changes 
in u at each point of the cell are given by the balance between the fluxes over the 
ER membrane, over the plasma membrane and the diffusion fluxes. Assuming a 
homogeneous distribution of the ER, the model equation for u reads in each cell: 

I = W. W, - J„) + A.g. (1) 

Jr is the ER release flux through the InsP^R, is the flux of cytosolic Ca^“*" 
through the SERCA pump into the stores, and Ji and Jo denote the inward and 
outward fluxes over the plasma membrane. Cu is the buffering capacity and Du is 
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the effective diffusion coefficient in the cytosol (see below). The time evolution of 
ER-calcium, v, is modelled by 



dv 

dt 



= ^{Js 

O-j; 



Jr)-j 



( 2 ) 



where a is the cytosol/ER volume ratio and Cy the buffering capacity in the ER. 
Due to the reticulated structure of the ER, the diffusion of Ca^+ is assumed to be 
negligible. 

The ER release flux is described according to Li and Rinzel [23] as 



Jr={k\(-— . , r) +fc2)(ti-u). (3) 

It includes the activation by InsPa, CICR, and the inhibiting effects of high con- 
centrations of on the InsPaR. The parameters k\ and /c 2 are the flux rate 

constants for the open InsPaR and the leak flux, respectively. is the disso- 
ciation constant of InsPa binding to the InsPaR, while Ka and Ki denote the 
dissociation constants of Ca^^ to putative activating and inhibiting binding sites 
of the InsPaR, respectively. The SERCA flux Jg is taken as [24] 



•A = Vs — 






( 4 ) 



with maximal flux Vg and dissociation constant of Ca^“^ to the SERCA pump Kg. 
The inward flux Ji over the plasma membrane is described by 



— '^0 + Vy 



R 



Ky + R 



( 5 ) 



It incorporates a small leak flux vo and a receptor-activated calcium entry with 
maximal flux Vy and half activation constant The parameter R is the di- 
mensionless concentration of hormone receptors occupied by agonist, the value of 
which will depend on the agonist dosis and total receptor concentration. The ac- 
tive transport of Ca*^^ out of the cell, by the plaisma membrane Ca'^“*“- AT Pa.se, is 
given by [4] 






(6) 



with maximal flux Vq and dissociation constant Kq. 



InsP;^ dynamics. The time evolution of the InsP .3 concentration, w, is modelled by 

= vp{R) - ky,w (7) 

where vp{R), ky, and Dyj are the velocity of production by PLC, the degradation 
rate and the cytosolic diffusion coefficient of InsP^, respectively. The activity of 
PLC depends on the active receptor concentration; we take vp{R) = vp-^p^. Note 
that in Eq. 7 the changes in w are assumed to be Ca^+-independent, so that Eq. 7 
is not coupled to Eq. 1. 
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Intercellular flux conditions. can diffuse through the gap junctions with the 

effective permeability Pu and InsP 3 with the permeability Py^. In the absence of 
electrical potential differences between cells, the driving force is the concentration 
difference across the gap-junctional connections. Let Uj denote the calcium con- 
centration in the j-th cell of the array. For Uj, the lefthand- and righthand-side 
intercellular fluxes are 



= Pu{uj{Q,t) - Uj-i{L,t)) 

x=0 

= Pu{uj+i{0,t) - Uj{L,t)), (8) 

x=L 

j = 2, • • • , — 1. Equivalent conditions can be formulated for w. At both ends of 

the array, zero-flux boundary conditions are chosen for u and w. 

Buffering. The effects of calcium buffers are included in the model via a rapid- 
equilibrium approximation, assuming that the buffer kinetics are fast compared 
to the reaction and diffusion terms. Provided that the buffers are not saturated 
by Ca^^, an effective Ca^+ diffusion coefficient, Dy, and an effective Ca^+ perme- 
ability, Pyj can be defined as [35, 19] 

M 

Dy + E D,B,/K, p 

Du^ and Pu = y^. (9) 

Dy (Py) is the diffusion coefficient (permeability) of Ca'^^ if no buffers were 
present, and Bi, Ki and Di denote the concentration, the dissociation constant 
and diffusion coefficient of the i-th buffer species {i = 1,...,M). The buffering 
capacity Cy is 

M 

= l + (10) 

2=1 



Dy 



dx 



D, 



duj 

dx 



Heterogeneity between cells. The Via vasopressin receptor is not evenly dis- 
tributed in the liver [27, 34]. The amount of hormone receptor is higher in the 
pericentral than in the periportal region. The heterogeneities in the amount of 
agonist receptor are modelled by assuming differences in the concentration of ac- 
tivated hormone receptors, R, between cells. 

Parameter values and numerical methods The diffusion coefficients and kinetic 
parameters are chosen according to Allbritton et al. (1992) and Hofer (1999), re- 
spectively (see Tab. 1). Solutions of the model equations are calculated using an 
implicit Euler method, with a time increment of 10 ms and a spatial discretisation 
of 20 grid points per cell. 
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Parameter 


Value 


Parameter 


Value 


a 


15 


Kc 


1.0 /iM 


fci 


40 s”^ 


Vo 


1.8 /iM/s 


k2 


0.02 s-i 


Ko 


0.12 /iM 


K, 


0.4 fiM 


Vp 


l.p /xM/s 


Ka 


0.4 ixM 


kyj 


0.1 s-i 




0.2 /iM 


Dyj 


225 /im^/s 


Vs 


9 /iM/s 


Du 


225 /iiri^/s 


Ks 


0.12 /iM 


Cu 


20 


VQ 


0.1 /iM/s 


Cu 


30 


Vc 


2.0 /iM/s 


L 


15 /xm 



Table 1 . List of parameter values 



3. Results 

Single-cell dynamics. An increase in agonist dose is equivalent to an increase in R. 
Taking as a bifurcation parameter, the stationary homogeneous Ca^^ distribu- 
tion looses stability for R > 0.352 and regains it for > 2.217; the cytosolic 
concentration at the steady states is 0.087 /iM and 0.257 /iM respectively. Between 
these two points the model shows sustained regular oscillations. The oscillation 
frequency is a function of R. The two points where the system loose stability are 
the two Hopf bifurcation points in the case of no spatial variation. 

Two coupled cells. First we consider two non-identical coupled cells and study the 
conditions under which the oscillations are synchronous. The uncoupled cells oscil- 
late with different intrinsic periods, T\ and T2. The diffusion of is sufficient 

to synchronise the oscillations. The synchronous oscillations have the same period 
and a constant phase relationship; the cell with the lower intrinsic period spikes 
first. In analogy to other systems of coupled oscillators [22] there exist a critical 
value for the intercellular permeability, P^’, above which the cells synchronise. A 
high degree of heterogeneity between the cells requires an increased gap- junctional 
connectivity to synchronise them. In Fig. 3 A the numerical estimation of P^ is plot- 
ted against the period ratio T1/P2 for different values of Du- The period ratio is a 
measure of how strong the cells differ; for T 1 /T 2 = 1 the cells are identical. 

The intracellular diffusivity of calcium influences the coupling of oscillating 
cells. In the range of experimental values for Du, Pu declines for a decreasing 
diffusion coefficient (Fig. 3A). This effect results from the regenerative nature of 
the intercellular signalling. At each new spike, Ca^“^ diffuses from the cell that 
spikes first in the neighbouring cell. To generate a spike in this cell, it is sufficient 
to induce a calcium release in the space adjacent to the gap junctions. The calcium 
increase then propagates as a calcium wave through the cell. 

Some analytical results on P^ can be obtained. For Du ^ 0, P^ shows a 
square root dependency on Du- Let ^ = xj\fD^ and k — P/y/D^ be a rescaled 
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Figure 1. Critical permeability as a function of the period 
ratio T\/T ‘2 (A) for Du = 27 (a), 18 (a), 9 (•), 4.5 /jm^/s (o), 
Pu, = Opm/s; and (B) for = 0 (a), 3 (a), 9 (•),15 /im/s 
(o), Du = 27 Ti and T 2 are the oscillation periods of the 

uncoupled cells. Only T 2 is varied. 



space variable and a rescaled intercellular permeability, where r is an arbitrary 
time constant. The length of the cell in the new variable ^ is A = Lj \JD uT. As 
Du — ^ 0, the influence of the boundaries can be neglected, because A 00 . For 
the rescaled system of equations, one obtains a critical permeability Therefore 
we have 

\im = lie ( 11 ) 

A similar result has been obtained for specific models of bistable wave propagation 
[37, 19]. 

The model permits to assess how the intercellular diffusion of InsP;^ affects 
the coordination of Ca^~^ signals. InsP;^ diffusion will bring the frequencies of the 
two cells closer together, by equalising the differences in InsP;^ concentration. In 
Fig. 3B, F/^‘ is plotted against T 1 /T 2 for different values of Puj. Although an in- 
crease in Pu, changes F/^ , the effects remain moderate. This is due to the fact that 
the extent of external Ca^"^ influx (Eq. 5), which also exerts a large control over 
the frequency (see [20, 8]), remains unchanged if InsPa diffuses. 

Injection of calcium bujffers. We have shown in the previous section that the co- 
ordination of coupled cells depends on the intracellular diffusion of calcium. The 
total concentration and composition of calcium buffers determines Du (Eq. 9), in 
particular the presence of diffusive buffers increases Du- A possible experimental 
procedure to change Du is the addition of diffusible cytosolic buffer (e. g. BAPTA, 
[36]). Besides modifying Du, external buffers change the kinetics of u (see Eqs. 
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1 and 10). Let Be, Ke, De be the concentration, the dissociation constant and 
diffusion coefficient for the added external buffer and define (5 = . The lat- 

ter measures the proportion of added buffer to internal buffers. For the parameter 
range considered, the amplitude and the frequency of the oscillations decrease with 
j3. The oscillations eventually cease if j3 reaches a critical value (Hopf-point). 

Injection of buffer will have different effects on the synchronisation of coupled 
cells depending on which cell is injected. On the one hand, injection of buffer in 
the cell with the lower intrinsic frequency will tend to desynchronise the cells 

increases with /3, Fig. 2A dashed line). On the other hand, injection in the 
cell with the higher intrinsic frequency will have the opposite effect (P^ decreases 
with /3, Fig. 2A full line). If buffer is injected in both cells the cells will tend to 
desynchronise (Fig. 2B). In all three cases the diffusivity of the injected buffer 
determines the strength of the effect. In Fig. 2C, a possible outcome of such an 
experiment is shown. Before addition of buffer the cells are synchronous {Pu = 
0.27 fim/s > P^, * in Fig. 2B). At time t = 1000 s buffer is added so that (3 = 0.2. 
After the addition the cells desynchronise (P^ < P^, -h in Fig. 2B). 

Periodic intercellular waves. Periodic intercellular waves have been described, for 
example, in weakly coupled oscillators and coupled integrate and fire oscillators 
(e.g. [11, 12, 3]). In our case, as in previous models, the spatial distribution of 
intrinsic frequencies is an important parameter. It will affect, among other things, 
the direction and speed of the intercellular waves. In hepatocytes multiplets and 
in the blowfly gland, the frequencies of the uncoupled cells differ of about a factor 
two [33, 38]. 

In the intact systems (i.e. blowfly gland, liver), the conditions under which 
long-range coordination arises remain to be elucidated. We studied whether a local 
pacemaker can organise unidirectional waves in an array of heterogeneous cells. A 
realisation of such an arrangement is shown in Fig. 3. The intrinsic frequencies of 
the cells vary according to the activated agonist receptor concentration, R. Fig. 3A 
depicts the frequency distribution of the uncoupled cells. The P- values have been 
drawn from of a normal distribution, resulting in a frequency of 15.8±1 mHz 
(frequency histogramm. Fig. 3B). The leftmost cell is assigned a higher P- value, 
and therefore has a significantly higher frequency than the other cells (19.6 mHz, 
* in Fig. 3B). 

The oscillations of the uncoupled cells are not coordinated (Fig. 3C), as 
experimentally observed under perfusion with gap junction blockers [38]. If the cells 
are coupled, with a Ca^"^ permeability that is in the range of the one determined 
for two coupled cells, the oscillations synchronise. First, intercellular waves initiate 
from different sites. The multiple initiation sites disappear after a few cycles, and 
waves originating at the leftmost cell travelling along the array develop (Fig. 3D). 
The leftmost cell plays the role of a pacemaker. These repetitive waves show an 
approximately constant wave speed. This has indeed been reported experimentally 
[28, 38]. For the simulation in Fig. 3 we get a wave speed of 2.62=b0.3 /xm/s, 
an intracellular delay of 0.2T0.1 s and a gap-junctional delay of 5.3T0.8 s. For 
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Figure 2. Changes in the critical permeabilty after injection of 
buffer. (A) Injection in the cell with the higher {R = 0.65, full line) 
or lower {R = 0.525, dashed line) intrinsic frequency. (B) Same as 
(A) but injection in both cells. (C) oscillations of cytosolic Ca^”*" 
in two coupled cells, R values as in (A), Pu = 0.27//m/s; at time 
t = 1000s buffer (with = 112.5 /xm'^/s) is added, j3 changes 
from 0 to 0.2 (* and + in B). For all figures Du = 11.25 /im'^/s. 
The diffusion coefficient of the injected buffer is = 0 (A), 11.25 
(A), 22.5 (•), 56.25 (o), 112.5/xmVs (♦). 



permeabilities that are not sufficiently high to completely synchronise the cells, 
preliminary results indicate that multiple initiation sites can persist. 



4. Discussion 

The presented mathematical model is aimed at describing the coordination of 
Ca^^ oscillations in multicellular systems, such as the liver or the blowfly gland. 
In these systems, large-range intercellular waves are observed, if the cells are gap- 
junctionally coupled and stimulated with InsP^ mobilising agonists. The model 
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Figure 3. Periodic intercellular calcium waves in an array of het- 
erogeneous cells. (A) Intrinsic frequencies of the uncoupled cells. 
The frequency varies according to the activated agonist receptor 
in each cell, drawn from a normal distribution {R = 0.6 ± 0.05). 
Cell 1 is assigned a higher value {R = 0.8). (B) Frequency his- 
togram. (C) Space time plot of the cytosolic calcium concentration 
if the cells are uncoupled {Pu = Pw = 0/im/s); (D) if the cells 
are coupled {P^ = 0.6/xm/s, Pw = 1.5/im/s). Du = 22.5 //m^/s, 
V{) = 0.05 /iM/s, the other parameters as in Table 1. (For colored 
picture see color plate 2) 
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mechanism relies on the intercellular diffusion of Ca^“^ through gap junctions and 
on the regenerative capability of the signal by CICR. Cells are assumed to be 
heterogeneous in their agonist receptor concentration and so oscillate, if uncoupled, 
with different frequencies. Theoretically other factors (e.g. cell volume, size of the 
Ca^~^ -stores), may cause heterogeneities in the cell population. In agreement with 
previous studies [18], first investigations show that the results will qualitatively 
remain the same. 

To synchronise the oscillations of coupled cells, a minimal value for the gap- 
junctional permeability of Ca^“^ is required. It is found to be in the range of exper- 
imentally reported values [10]. The critical permeability depends on the degree of 
heterogeneity and on the properties of the cytosolic compartment. In particular, 
the intracellular Ca^+ diffusivity can affect the intercellular coordination. This is 
due to the fact that local Ca^"*" elevation near the gap junctions may trigger CICR 
(provided that Ca^“^ release sites are located in the vicinity of the gap junctions) 
and trigger a Ca^”^ wave propagating through the entire cell. Thus, the model 
predicts that injection of Ca^^ buffers can desynchronise coupled cells, whereby 
high diffusible buffers will be more effective. The repetitive intercellular waves, 
observed experimentally, are qualitatively reproduced by the model. We showed 
that a pacemaker in a cell array can be sufficient to organise unidirectional waves 
with approximately constant speed. Another hypothesis for the wave generation is 
a gradient in agonist receptor concentration along the cell array [27, 34]. However, 
our preliminary results indicate that in this case it is difficult to obtain waves with 
(constant speed. This problem requires further study. 

In our model, InsP.^ is a control variable that sets the sensitivity of the 
InsP.^R to CICR but does not itself oscillate. Therefore, the intercellular diffusion 
of InsP;i will not coordinate the oscillations in adjacent cells. However, recent 
experimental data show that InsP;^ can oscillate concomitantly with Ca^~^ [17, 26], 
and that InsP;i buffers can abolish Ca^^ oscillations (P. Burnett, S. Joseph and 
A.P. Thomas, submitted manuscript). Several mechanisms may account for InsP.^ 
oscillations, including Ca^+ activation of PLC, and Ca^^ feedback on G-protein 
activation [25, 6, 21, 31]. This indicates that, at least in some cell types, InsP.^ can 
have an active role in the intercellular coordination of Ca^^^ oscillations. 
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1. Introduction 

This year marks the fiftieth anniversary of Turing’s famous paper [10] where he pre- 
dicted that instabilities could result in otherwise stable two-species systems when 
diffusion was added to the system, so-called diffusion-driven instabilities. This idea 
has later been investigated more profoundly by Meinhardt and Gierer [3]. They 
proposed that diffusion with long-range inhibition and local self- activation repre- 
sents a basic mechanism for pattern formation in biology. Do the ideas originating 
in Turing’s paper and further developed by Meinhardt and others apply to dis- 
crete models as well? If this is not the case, what kind of mechanisms cause pattern 
formation in discrete models? 

By a discrete model we mean a model of separate, identical cells fixed in a 
lattice, where the development of each cell depends on its internal dynamics and 
its interaction with other cells in the lattice, usually the immediate neighbours. In 
[7] we distuinguished between diffusion driven (DD) models and signalling driven 
(SD) models. In that paper, where regreatably Meinhardt’s contributions were 
not mentioned, it was found that his ideas carry over to discrete DD models, but 
that the properties of the homogeneous state only predicts, not explains, the final 
pattern. For discrete SD models the ideas of Meinhardt do not seem applicable. 
There is no diffusion and no diffusion rates, and all interaction is between nearest 
neighbours. The interaction mechanism cannot be separated from the internal 
dynamics. To “turn off” the interaction would mean to disrupt the whole model. 
Alternative pattern generating mechanisms must be sought. Thus, in the present 
paper we are not only concerned about which patterns can be formed in discrete 
models, but how^ by which mechanisms, patterns can be generated. 



2. The Method 

We have applied our ideas to the discrete Fisher equation, the Delta-Notch model 
of Collier et al. [1], the juxtacrine signalling model of Owen and Sherratt [6] and DD 
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models with activator-inhibitor internal dynamics. The Fisher equation is a one- 
species model and apparently does not have patterned stable states. Nevertheless 
we have found it an extremely useful playground and testbed in the elaboration 
of our methods. 

Let the identical cells Cj be arranged in an infinite or semi-infinite string, 
and let Xj G be the state of Cj , expressing the concentration of n species char- 
acterising the state of the cell. A discrete model with nearest neighbour interaction 
is the lattice differential equation (LDE) 

Xj = + (1) 

where j ^ 7 a or j ^ N. We only consider models where /(x, y, z) = f{z, y, x). 

A travelling wave solution of Eq. (1) is commonly defined as a solution of the 

form 

Xj{t) = 4>{j - ct), c^O, (2) 

where c is the wave velocity and cj) expresses the shape of the moving wavefront. 
We assume c > 0. This definition is convenient for one-species models when the 
wave carries the cells from one common state ahead of the wavefront to another 
common state behind the wavefront. In multispecies models, the typical case is 
that the cells far ahead of the wavefront occupy a homogeneous, unstable state, 
while the cells behind the wavefront approach a stable periodic or almost periodic 
spatial pattern. 

Assume a stable, periodic patterned state exists and let m be the period of the 
pattern. Then Eq. (2) should be replaced by the translation invariance condition 

~ Xj{t — T), (3) 

where T = m/c is the time needed for the wave to pass m cells. With these 
assumptions Eq. (1) can be reduced to m equations for the m cells in a pattern 
period, 

Xk{t) = f{xk-l,x^,.,Xk+\), /c = l,...,m mod m. (4) 

For k = 1 the argument of is ^ - T, and for k = m the argument of x\ 
is f + T, otherwise the arguments are t. Thus, this is a set of so-called mixed 
functional differential equations. Very few general results on this type of equations 
exist [8, 9]. 

To analyse Eq. (4) we assume the existence of an initial homogeneous state 
and linearise around this state. This leads to V = MY, where Y is an mn- 
component state vector composed of the deviations of Xk from the homogeneous 
state. Using a generalisation of the reduction method of Othmer and Scriven [5, 7], 
the eigenvalue equation for M can be reduced to m eigenvalue equations expressed 
in terms of certain nxn matrices corresponding to internal and intercellular inter- 
actions. The eigenvalue equations are transcendental due to exponentials stemming 
from the periodicity condition, and the wavespeed c appears as a free parameter. 

The method can in principle be applied to any discrete system, but there 
are some serious limitations. Numerical simulations on DD systems reveal that 
generally, the dominant wave generates a pattern that is only almost periodic. For 
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this reason we have found it necessary to refine the concepts. A travelling wave 
which satisfies Eq. (3) exactly will be called a translation invariant travelling wave 
(TITW). TITWs generate periodic patterns or no pattern on infinite strings when 
no boundary effects are present. A propagating front which only satisfies Eq. (3) 
approximately, can still be called a travelling wave. We will refer to a travelling 
wave of this type as a quasi translation invariant travelling wave (QTITW). Both 
types of wave usually have an oscillating wavefront. Typically a QTITW seems to 
generate a periodic pattern, but the slightly out of phase oscillations eventually 
make a few cells break the pattern. Then it seems that the phase is reset, but 
not exactly, and further along the string another couple of cells break the pattern, 
creating erratic irregularities in the otherwise periodic pattern. We have adapted 
a second approach from the basic method to cope with these cases. 

3. Results 

3.1. General Results 

We have shown that quite independent of the dynamics of the system, the only 
patterns that can be generated by a TITW without oscillations in time, is the 
alternating, period two pattern or no pattern at all (m = 1). In all other cases the 
matrices determining the eigenvalues are complex and have complex eigenvalues, 
leading to temporal oscillations. 

The Discrete Fisher Equation 

The discrete single species equation 

Xj = g{xj) -h d{xj^x - 2xj + Xj>i), d > 0, (5) 

where g{x) = ax{l — x) or a function with a similar shape, can be considered as a 
discretisation of the continuous Fisher equation or as a discrete one-species model 
of interacting cells in its own right. Zinner et al. [11] proved that for c larger than a 
certain minimum speed c*, a travelling wave solution with a monotone wavefront 
and wavespeed c exists, carrying the system from the unstable state xj = 0 at 
t = — oo to the stable state Xj = 1 at ^ = -fcxo. 

Numerical simulations on a long, but finite string show that a positive pertur- 
bation of the state x i = 0 quickly develops into a travelling wave with a monotone 
wavefront and a speed which is close to the theoretical minimum speed c* . Zinner 
et al. did not prove that the dominant wave moves with the minimum speed, while 
Kolmogorov et al. have proved this for the continuous case [2]. 

Linearising Eq. (5) and solving the eigenvalue equation for m = 1 (no pattern) 
and A real, we found a dispersion relation A = A(c) which is exactly equivalent 
to the condition of Zinner et al. for the existence of travelling waves of Eq. (5), 
and which shows that travelling waves exist for c > c*, only. To determine the 
wavespeed of the dominant wave we performed a similar linearisation around the 
final state Xj = 1, and by combining the results of the two linearisations we were 
able to show that the dominant wave travels with the minimum speed c*. We 
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attribute the discrepancy of a few percent between c* and the speed computed by 
numerical simulations to boundary effects on the finite string. 

When the discrete Fisher equation is considered as a discretisation of its 
continuous counterpart over a string of finite length L into N cells of width L/N, 
we found that the speed of the dominant wave is always smaller in the discrete 
system than in the continuous system, and that they become equal in the limit 
N oo. 

The complete eigenvalue spectrum of M consists of an infinite sequence of 
complex eigenvalues with arbitrarily large positive and negative real parts. Why 
are the oscillating solutions corresponding to the complex eigenvalues not seen in 
simulations on the nonlinear equations? In fact, they are. If the travelling wave 
is initiated with only positive perturbations of the first few cells, a wave with 
a monotone wavefront and speed c* quickly develops, carrying Xj towards 1. If 
the initial perturbations are negative, a negative travelling wave develops, there 
are no oscillations, and all Xj diverge quickly towards — oo. However, if the initial 
perturbations have both signs, some of the cells oscillate for a short while, and then 
all Xj start moving towards either 1 or —oo as in the first two cases. A complete 
analysis of oscillating solutions of the linearised equations and their relation to the 
nonlinear equations remains to be done. 

3.2. Signalling-Driven Models 

In this section we discuss the model for Delta-Notch lateral inhibition by Collier 
et al. [1] and the model for juxtracrine signalling by Owen and Sherratt [6]. 

Notch is a transmembrane protein which seems to be involved in a large 
number of patterning processes. There is evidence that Notch in a cell binds to 
Delta anchored on the membrane of the immediate neigbouring cells. Let Dj and 
Nj represent the levels of Delta and Notch activity in the cell Cj , respectively. In 
suitable scaled variables the model of Collier et al. is 

Dj=g{l-S{N„dN,p)-Dj), 

N,=S{{D)j,en,p)-N„ 

where ^ is a positive parameter, {D) j = (D^-i +Dj+i)/2, 5 is a sigmoid function, 
p its steepness parameter, and Op and ^yv the thresholds of Delta and Notch 
activity Thus, the bctsic assumption of the model is that expression of Notch in 
a cell inhibits the expression of Delta in the same cell, while expression of Delta 
activates the expression of Notch in the neighbouring cells. 

Already in the original paper there were indications of travelling waves in 
the model. These indications were supported in a recent analysis [7]. No proof of 
the existence of travelling waves in the DN system exists, but numerical simula- 
tions show a dominant TITW with a non-oscillating wavefront which generates an 
extremely robust period two pattern. Hence, we have limited ourselves to investi- 
gating non-oscillating travelling waves. 

Our main theoretical results on this model are the following: Many irregular 
patterns can result from random initial perturbations, but only the alternating 
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pattern can be generated by a travelling wave. This wave is the dominant wave in 
the sense that it diverges faster from the homogeneous state than any other non- 
oscillating wave. The calculated wavefront profile of the dominant wave agrees 
with the observed, near the initial state as well as near the final state, and its 
speed is the minumum possible speed c*. Finally, we have shown analytically that 
if S is the Hill function, c* is very close to being a linear function of p, even for 
biologically realistic values, and approaches a linear function asymptotically. 

In the model for juxtacrine signalling of Owen and Sherratt [6] there is also a 
clear indication of a pattern-generating travelling wave (see their Fig. 8). For free 
and bound receptor levels the period eight pattern is characterised by pronounced 
spikes separated by seven roughly equal values. Using their parameter values we 
have taken a close look at this wavelike (numerical) solution. Beyond any reason- 
able doubt the motion is a TITW, despite the fact that the motion of each variable 
is extremely intricate, with small, irregular oscillations before it sets out towards 
its final value. 

Even if the wave really is a TITW, it is not obvious that the linear methods 
described in the present paper can explain the motion close to the initial state. 
One reason may be that the rapid rise of the spikes influences the motion of the 
cells many period lengths ahead of the wavefront. Also, the linear approximation 
may be too crude due to the very high spikes. These problems point to possible 
limitations in the validity of linear approximations in travelling wave analysis. 

3.3. DD Models with Activator-Inhibitor Dynamics 

For two-species DD models with internal activator-inhibitor dynamics, Eq. (1) is 

Xj = g(xj) + D{xj_i - 2xj + Xj+i), (7) 

where g is the activator-inhibitor mechanism and D contains the diffusion rates 
(activator) and dj (inhibitor) on the diagonal. Combined analytical and numerical 
investigations have revealed that in general TITW solutions do not exist for this 
type of model. 

As an illustration we consider the Thomas system with parameter values as 
given in [4], Chapter 14, and two sets of diffusion rates; dA = 0.01, dj = 0.40 (sim- 
ulation 1) and = 0.01, dj = 0.60 (simulation 2). Both sets satisfy the Turing 
instability conditions [7] . In both simulations a perturbation in the leftmost cell in 
the lattice initiates a travelling wave propagating through the lattice and gener- 
ating an alternating up-down pattern. In simulation 1 the wavefront is monotone 
and the up-down pattern persists through the entire lattice. The solution appears 
to be a TITW. In simulation 2 the wavefront is oscillating with increasing ampli- 
tude until the up-down pattern is broken (Fig. 1). Note the irregularities in the 
alternating up-down pattern (Fig. 2), showing that the solution is a QTITW. 

Extensive investigations of other diffusion rates and Gierer-Meinhardt and 
Schnakenberg internal dynamics show that a wide range of patterns can be formed, 
but that the patterns are in general quasi-periodic and generated by a QTITW. 
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d^=0.01 . d, =0.60 




Figure 1. Wavefronts of activator solutions in cells in a lattice 
consisting of 50 cells assuming activator-inhibitor dynamics with 
Thomas type kinetics and diffusion rates dA = 0.01, dj = 0.40 
(left) and = 0.01, dj = 0.60 (right). The leftmost and right- 
most cells are not included to leave out boundary effects. 




Figure 2. Steady state final pattern of the activator solution 
on a lattice consisting of 100 cells assuming activator-inhibitor 
dynamics with Thomas type kinetics and diffusion rates = 

0.01, dj =0.60. 

We suggest that the simulation results may be explained by modifying our 
general method in the following way. We linearise Eq. (7) around the homogeneous 
state and assume solutions of the form 

Uj{t) = v{a) exp{—a{j — ct) — ib{a){j — ct)), ( 8 ) 

where a is a free, real parameter. The eigenvalue equation of M relates the eigenval- 
ues A(a) = ii{a) -\-iiy{a) to a and b{a). Our main interest lies in the dominant wave. 
For each value of a we seek the value of 0 < 5 < tt that maximises p. We conjecture 
that the dominant wave has wave speed c* = min c(a), where c(a) = ij,{a)/a, and 

a>() 

look for solutions that satisfy Eq. (3). This leads to the requirement = k/m 
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where k is an integer and 

27T 27TC* 

Thus, must be rational with the pattern period m in the denominator, otherwise 
the solution is not a TITW, but a QTITW. If the latter is the case, an approximate 
pattern period may be estimated by finding a rational number that is close to 

For the parameter values in simulation 1, is exactly equal to 1/2, i.e. the 
linear analysis predicts that a period two pattern is generated by a TITW. For 
the parameter values of simulation 2, 0.476, which is close to 1/2. Thus, 

we expect a period two pattern with irregularities. In fact, 10/21 = 0.47619 
0.476, suggesting that m = 21 is an approximate pattern period. This is in rough 
agreement with Fig. 2. 



Conclusions 

In the present paper we have developed and applied linear methods to analyse 
pattern generating, propagating solutions in one-dimensional lattices of interacting 
cells. These are our main findings: 

Travelling waves seem to be a generic feature in discrete cell lattices, implying 
that signals can be propagated to any distance in the lattice. Translation invariant 
travelling waves generate periodic spatial patterns, while quasi translation invari- 
ant waves generate almost periodic patterns, where the periodicity is broken at 
irregular intervals. In diffusion-driven models, the latter seems to be the rule. 

If the wave has a monotonic wavefront, only a single periodic pattern, the 
alternating period two pattern, or a homogeneous final state can be generated. 
Waves with an oscillating wavefront can be translation invariant or not. 

It seems to be a rule that among all possible travelling waves, the dominant 
wave travels with the minimum possible speed. 

Patterns generated by travelling waves are robust. By random perturbations 
of the initial, homogeneous state, a large number of different patterns can usually 
be obtained. Contrary to this, the dominant, translation invariant travelling wave 
generates only a single pattern which is propagated all over the lattice. Diffusion- 
driven models possess this robustness property only approximately. 

One can imagine that in a biological organism, the generation of a spatial 
pattern in an inherently homogeneous piece of tissue could be set off either by 
processes initiated more or less simultaneously all over the lattice, or only in a 
(small) part of the tissue. In the latter case a wave may be initiated, sweeping 
all over the tissue, and generating a robust periodic or quasi periodic pattern. 
Thus, it seems that in tissues where a strictly periodic pattern is generated, one 
should look for a signalling driven, wave-generating mechanism. If the pattern is 
only almost periodic, a diffusion-driven, wave- generating model may still be more 
appropriate than a model with a Turing-like pattern generating mechanism. 
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Despite certain limitations to its applicability, linear analysis combined with 
numerical simulations yield valuable and some times exact information on travel- 
ling waves on non-linear multicellular systems. 
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1. Introduction 

In his seminal book Biochemical Systems Analysis: A Study of Function and Design 
in Molecular Biology^ Savageau (1976) proposed an agenda toward a true under- 
standing of complex networks in biology. In addition to developing mathematical 
models of specific phenomena, he suggested that more general and far-reaching in- 
sight could be gained from studying the generic designs of systems, such as mecha- 
nisms of feedback inhibition by end product, which were showing up in nature time 
and again. In an editorial for The New Biologisf Savageau (1991) illustrated the 
rationale for investigating designs with an instructive demonstration of the differ- 
ent levels of understanding that can be obtained with standard experimentation. 
He compared the quest for biological insight with different types of analyses of a 
television set. At the lowest level of understanding, the features of the TV set are 
simply described in terms of color, size, and shape. This description can extend 
to internal parts like the picture tube and is easily made quantitative. Clearly, 
science by description parallels the early beginnings of biology, where plants and 
animals were qualitatively, and later quantitatively, described and classified. Sav- 
ageau identified as the next higher level the isolation and analysis of components. 
Each interesting-looking component may be analyzed in an electronics lab, and if 
it consists of a number of parts itself, one could disassemble the components and 
study each part. This strategy is called reductionism and has been the dominant 
paradigm of biology in the twentieth century. In some sense, the completion of the 
human and other genome projects is the Holy Grail of this approach, because the 
genome is considered the blue print that implicitly contains all information needed 
for the functioning of an organism in its environment. While reductionism has been 
extremely successful and will continue to yield enormous amounts of new insights, 
returning to the TV set analogy shows that it alone is not sufficient: Even the 
complete collection of parts is not sufficient to understand how the TV set works. 
To infer the role of a particular component, this component may be taken out or 
altered. Our analogous experience with knockout animals attests to the fact that 
this strategy of component alteration is sometimes, but not always, successful. If 
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a particular gene is solely responsible for a particular function, then this func- 
tion changes or ceases with a destructive mutation of the gene. However, if the 
function is crucial to the organisms survival under normal conditions, the organ- 
ism seldom relies on just one gene or one mechanism and instead has alternative 
means of coping with the loss of a single gene function. Thus, knocking out this 
gene may primarily have no visible effect, even though the gene is an important 
component for normal functioning. The highest level of understanding comes from 
what Savageau termed design principles. These general rules provide a rationale 
for why a process or system is composed of certain components and is regulated in 
a certain fashion. In a detailed analysis, Savageau (1976) demonstrated this type 
of explanatory analysis: he explored the advantages of an end product inhibiting 
the first rather than a later step in a linear chain of reactions, thereby providing 
an explanation for why this type of inhibition seems to dominate natural systems 
(see also [1]). 



2. Design Principles 

The design of its internal systems constitutes the basis for the normal functioning 
of an organism. It is often surmised that evolution has optimized these designs 
and that small variations in structure typically lead to inferior performance. Well- 
documented examples are the pentose phosphate pathway and the TCA cycle. 
Spearheading analyses of these crucial pathways, Melendez-Hevia and cowork- 
ers (e.g., [8]) used different mathematical approaches, including game theory and 
optimization, to demonstrate that the pentose phosphate pathway cannot be im- 
proved with alterations that are readily available to the organism. Similarly, Mit- 
tenthal et al. [9] suggested the optimality of the Krebs cycle. Heinrich’s group 
(e.^., [3]) analyzed in great detail the functionality and optimality of single en- 
zymes of small multi-enzyme systems. Savageau and coworkers executed rigorous 
comparative analyses of certain aspects of the immune response (e.^., [7]) and 
of gene regulation under different environmental conditions (e.^., Hlavacek and 
Savageau, [4]). The insights from the latter investigations led to an explanatory 
framework of gene regulation, called Demand Theory {e.g., [17]), which is now sup- 
ported by numerous examples of microbial gene regulation. The design of a system 
provides the boundaries for what an organism is able to accomplish. If the produc- 
tion of a metabolite is not regulated, the serendipitous exogenous availability of 
the metabolite is not of much use. By contrast, if the pathway is designed such that 
any exogenous supply is reported back to the initial steps of production, thereby 
signaling reduction in synthetic activity, the saved energy can be used for other 
purposes, thus giving the organism an advantage. While a superior design is crucial 
for the organisms fitness, it must be accompanied with optimized guidelines for 
operation. As a simple example, suppose the organism needs to increase the con- 
centration of a metabolite. This could be accomplished by increasing production, 
decreasing degradation, inducing an activator, removing a repressor, up-regulating 
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a gene, or by combining different such means. Obviously, there are ample examples 
for all of these strategies in nature. In order to understand nature’s choice in a 
given situation, it is necessary to provide an objective rationale for this particular 
choice. In analogy with the exploration of design principles, it thus seems prudent 
also to search for general operating principles. A true understanding of the design 
and operation of a metabolic system is not purely an academic exercise, but has 
great practical potential. In a biotechnological setting, metabolic engineers are 
close to designing new organisms or to import new pathways from one organism 
into another. As a remarkable example, Zaslavskaia et al. [27] recently converted 
an obligate photoautotrophic organism into a light-independent organism through 
the introduction of a human gene coding for a glucose transporter. This alteration 
fundamentally changes the metabolic constitution of the organism and opens en- 
tirely new possibilities, such as the greatly simplified use of algae for fermentation 
processes. As it becomes easier to create new function in microbial organisms and 
cell lines, it becomes more and more crucial to understand how to control this 
function. 



3. Operating Principles 

Top-Down Approach. The search for operating principles may be approached from 
the top down or the bottom up. The top-down strategy attempts to explain why 
an observed integrated system is being operated in a particular fashion under 
given conditions. This naturally occurring operation is not always intuitive. As an 
illustration, consider the differential up-regulation of glycolytic enzymes in yeast 
upon heat shock, as it is illustrated in Fig. 3 (see [25] for details). The observed 
degrees of up-regulation (http://rana.lbl.gov/) are still subject to relatively large 
experimental error. Nonetheless, it is quite evident that they differ tremendously 
among the various glycolytic steps. For instance, 15 min after heat shock, the 
conversion of glucose into glucose 6-phosphate is about five to ten times more 
active than under the former, cooler temperature of 27C. In contrast, the next 
key step of glycolysis, the phosphofructokinase reaction, is essentially unchanged. 
Later steps toward pyruvate and ethanol are slightly up-regulated by about two to 
three times. These observations are interesting for three reasons. First, they are not 
immediately intuitive, and one would presumably have expected a different profile 
in which, for instance, just the first (rate-limiting ?) step had been up- regulated, 
or in which all genes had been up-regulated to the same degree. Second, our 
preliminary analysis [25] provides support for the non-intuitive expression pattern. 
Thus, the particular type of natural operation of the pathway is apparently not 
mere coincidence. Third, and maybe most important for future analyses of genome 
data, today’s premier strategy of genome analysis is the clustering of genes by 
degrees of up- or down- regulation upon a stimulus {e.g.., Quackenbush, [10]). Genes 
exhibiting a high deviation from their baseline expression level are considered very 
much involved in the response, while genes not showing much change are considered 
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independent and detached from the response. In our heat shock example, the 
glycolytic genes would have ended up in three or four different classes, and one 
would have concluded that they had not much to do with each other. As a case in 
point, we know full well that phosphofructokinase is a key step of glycolysis, yet 
clustering would not have identified this step. Similar misinterpretation is to be 
expected for genes coding for signaling proteins like hormones or G-proteins that 
have a high gain in a sense that small variations may have significant consequences. 
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Figure 1. Simplified representation of glycolysis and of diverg- 
ing pathways in the yeast Saccharomyces cerevisiae. The sizes of 
arrows reflect the magnitudes of metabolic fluxes. ‘Told” changes 
refer to degrees of up-regulation of the corresponding genes ten 
minutes after a rise in temperature from 27° C to 37° C. See text 
for further details. 

As a second example for a top-down investigation, we recently took a closer 
look at the genomic and biochemical responses of the trehalose cycle in yeast 
upon heat shock (Fig. 3). Trehalose is a disaccharide that aids in the thermo- 
stabilization of proteins. If the temperature of the yeast culture rises to 37° C or 




Keys to Understanding Biological Systems 



123 



higher, large amounts of trehalose must be produced in a short period of time. 
In fact, within less than an hour, the trehalose concentration may increase from 
almost undetectable traces to one gram per gram protein [5]. The structural de- 
sign of the trehalose cycle is primarily dictated by the biochemical necessities of 
creating the disaccharide from glucose, even though an alternative does exist in 
the form of the maltose pathway [2]. In contrast to the rather straightforward 
flux design, the operation of the trehalose cycle is not intuitive. For instance, if 
trehalose is needed fast, why does glucose inhibit the production of the precursor 
to trehalose, trehalose 6-phosphate? Also, if the goal is rapid production of tre- 
halose, why is the enzyme trehalase, which degrades trehalose to glucose, strongly 
up-regulated (http://rana.lbl.gov/)? Why are genes up-regulated that code for 
enzymes of glycogen synthesis, even though this synthesis competes with the pro- 
duction of trehalose? We are in the process of analyzing these questions in some 
detail by studying representative and relevant scenarios, such as operation un- 
der normal or increased temperature and under normal, elevated, and low glucose 
conditions [26]. While not providing proofs of mathematical rigor, the results of 
these investigations provide reasonable explanations, which in turn may be used 
as hypotheses for experimental verification. For instance, it appears that the in- 
creased activity of trehalase is necessary for dynamic reasons. To generate large 
amounts of trehalose, the cell has two options. It could moderately increase the 
activity of the enzymes of trehalose synthesis and continue in this fashion until 
the desired level of trehalose is reached. Instead, the cell strongly up-regulates the 
genes responsible for trehalose synthesis and also the genes of trehalose degrada- 
tion, though to a lesser degree. Dynamic analysis suggests that the second option, 
while more costly in terms of gene regulation, transcription and translation, leads 
to the desired amounts of trehalose in a considerably shorter period of time. With 
the parameter values in our analysis, the first strategy would take about 30 min- 
utes to reach the required trehalose level, while the second, apparently wasteful, 
strategy takes only about five minutes. Under heat shock conditions, time is of the 
essence, lest proteins begin to degrade, and this seems to dictate the cell’s modus 
operandi. 

Bottom- Up Approach. This approach investigates simple generic systems with 
the goal of a full and general understanding. For instance, it may focus on a 
reversible linear pathway that responds to an environmental stimulus and ask 
which steps of the pathway should be up- or down-regulated to what degree in 
order to achieve an optimal response, where optimality may include speed of end 
product availability, minimized accumulation of unwanted intermediates, and cost 
of using the regulatory machinery. 

As an example for a bottom-up analysis, we recently analyzed a simple se- 
quence of two branches, as shown in Figure 3 [24]. The goal of this analysis was to 
establish optimal operation guidelines under the assumption that product X 4 was 
to be maximized. Obviously, overexpressing all steps would lead to more and more 
product, so in order to obtain fair comparisons, the total expression effort was kept 
constant. Thus, it was assumed that either only one step could be quadrupled, that 




124 



E.O. Voit 







a 

Figure 2. Simplified representation of the trehalose cycle in Sac- 
charomyces cerevisiae. The sizes of arrows reflect the magnitudes 
of metabolic fluxes. The solid arrow indicates repression of tre- 
halose 6-phosphate synthesis by internal glucose. See text for de- 
tails. 



two steps could be doubled, or that one step could be doubled and another step 
repressed to half. 

The main result of this analysis was the following interesting insight: The 
optimal strategy depends on the regulatory design of the system. If the system is 
totally unregulated, several strategies lead to the same output of product X4. For 
instance, it is immaterial how the two branches at node X[ or X2 are over- and 
under-expressed, respectively, as long as the so-called flux-split ratio is the same. 
Thus, doubling the branch V2\ toward X4 and halving the competing branch U23 
leads to the same production of X\ as solely quadrupling V24. This result is in 
line with strategies currently suggested in metabolic engineering (e.g., [19] p. 427), 
which call for the identification of principal nodes and a targeted alteration of 
flux-splits at these nodes. The situation changes if the pathway is regulated. If X[ 
exerts feedback upon Uoi, halving v\o and doubling V\2 is inferior to quadrupling 
Vi2- If both X\ and X2 exert inhibition, as indicated in Fig. 3, doubling v\2 and 
V24 is superior to most other alternatives, but the best strategy for maximizing X4 
is actually the sole four-fold over-expression of 024^ Thus, as soon as regulation is 
involved in a metabolic network, a limited focus on flux-split ratios at principal 
nodes is no longer sufficient. 



4. Methods Needed for the Exploration of Design and Operating 
Principles 

The key strategy for exploring the role of a feature or process is the comparison 
between a system that shows this feature and an otherwise equivalent system that 
does not have the feature. By carefully testing the two alternative systems against a 
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Figure 3. Hypothetical pathway used for a bottom-up analysis 
of operating principles. The dotted arrows with question marks 
indicate that these feedback inhibition loops may or may not exist. 
Depending on their existence, different operating strategies of up- 
regulation lead to maximal synthesis of the desired product X.\. 

set of criteria of functional effectiveness, it is possible to arrive at objective reasons 
for the advantages or disadvantages of a particular design or strategy of operation 
(e.g., [7]). To allow for such close comparisons, a mathematical framework is needed 
that satisfies three criteria. 

1. The models derived from the mathematical framework must be rich enough 
to capture all relevant phenomena that one might encounter in a biological 
system. Such phenomena include various types of nonlinearities, satura- 
tion, and oscillations. 

2. The describing equations must be of a form that facilitates successive 
extensions to more complex scenarios. For instance, if the inclusion of 
some inhibitory effect requires significant structural and logistic changes 
in a model, it is difficult to compare objectively two alternative systems, 
one with and the other without the inhibition in question. 

3. It is beneficial if there is a one-to-one mapping between the features of 
the biological system and the mathematical equations. As an example to 
the contrary, suppose a biological process is described with a fourth-order 
polynomial. As soon as the process is slightly altered, chances are that all 
coefficients of the polynomial will be affected. By contrast, comparisons 
are greatly facilitated if some kinetic property is uniquely captured by a 
single parameter and if setting the parameter value to zero corresponds 
directly to removing the kinetic property. 
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Experience over the past three decades has shown the Biochemical Systems Theory 
(BST; [11,12]) is an excellent candidate for executing comparisons of alternative 
systems. Detailed descriptions of this modeling framework and its applications 
were presented in [13,16,21-24] The starting point for constructing a model within 
this mathematical framework is the commonly used strategy of describing a phe- 
nomenon with a set of diflFerential equations, one for each dependent variable 
in the form of a balance or node equation: 

dXi/dt = Xi = - V-,X^{Q) = (1) 

The Xi{i = l,2,...,n) represent genes, metabolites, or other components 
characterizing the dynamics of the system. and V~ are collections of all pro- 
cesses that directly lead to an increase or a decrease in respectively. They 
may be multivariate and depend on none, some, or all Xi,...,X,^. They may 
also include independent variables, , Xn^rn, which code for exogenous 

substrates, enzyme activities, or modulators that are constant during any given 
analysis. The crucial concept of BST is a representation of and V~ in terms 
of power- functions, which follows rigorous, well-established methods of numerical 
analysis. In a nutshell, the functions and variables are represented in logarith- 
mic coordinates, where they are approximated by multivariate Taylor series, of 
which only the constant and linear terms are retained. The linearized functions 
are subsequently translated back into Cartesian coordinates. The result, 

n+m n+m 

v+=a,\{xf\ V-=(d,Y{x';- ( 2 ) 

j=\ j=\ 

is a representation of and V~ as products of power-law functions in the de- 
pendent and independent variables. The non-negative multipliers ai and j3i are 
rate constants that characterize the flux rates between pools or variables. The 
real-valued exponents gij and hij are kinetic orders. Their magnitudes reflect the 
strengths of the effects that the corresponding variables Xj have on a given flux 
term Vi. A large positive value signifies a strong activating or augmenting effect 
exerted by Xj, a negative value signifies inhibition, and a value close to zero indi- 
cates that Xj is not very influential in the given flux term. Only those variables 
are included that directly affect the functions and V~ . All other variables have 
exponents of zero, which in effect eliminates them from the corresponding terms. 

At one value of choice, the operating pointy the representations in (Eq. 2) 
are exactly equivalent with the original functions and V~ , if the rate con- 
stants and kinetic orders are chosen appropriately. Close to the operating point, 
the representations are guaranteed by Taylors theory to be very accurate. Ample 
experience has demonstrated that the range of valid representation is often wide, 
sometimes spanning several orders of magnitude in the values of the dependent 
and independent variables. 
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If the true functions and are unknown, the numerical values of the parameters 
in the power-law representation (Eq. 2) are not known. Nevertheless, the struc- 
ture of the equations is completely predictable and can be formulated symbolically 
from information about which variables directly affect and V~ . This type of 
information is often available for metabolic pathways and genetic networks. Nu- 
merical implementations require the identification of parameter values; numerous 
methods have been developed for this purpose. 

If the power-law approximations (Eq. 2) are substituted in the general system 
description (Eq. 1), the result is a highly structured model known as an S-system: 

n-\-m n-\-m 

X, = ai\{xf^ (3) 

j=l 3=1 

As an alternative to the S-system formulation, the functions and V~ in 
Eq. (1) may be split up in several terms, which are each approximated as products 
of power-law functions. This strategy yields a so-called Generalized Mass Action 
( GMA ) system. The conversion of a GMA into an S-system model is a straight- 
forward mathematical step that does not require additional data or analyses. The 
advantages and drawbacks of S-systems versus GMA systems have been discussed 
widely in the literature. Common to both implementations is that power-law ap- 
proximations are usually more accurate than linear models and show a much richer 
repertoire of possible responses that include synergisms, limit cycles, and chaos. 
Furthermore, both types of systems have a format that is highly structured and 
therefore amenable to efficient algebraic analyses and to computational evaluations 
with customized software that readily produces steady states, stability, sensitivi- 
ties, robustness, and dynamical features. 

S-systems are particularly useful for the comparative analysis of alternative 
system designs and their operation, for two reasons. First, the steady-state equa- 
tions of S-systems are linear, when all variables are logarithmically transformed. 
This linearity permits algebraic analyses of critical features associated with the 
steady state, including parameter sensitivities, gains, robustness, and local sta- 
bility. In most cases, these analyses are executed by computer, but the fact that 
algebraic solutions exist permits highly efficient numerical evaluations that would 
not be available otherwise. For instance, the computation of parameter sensitivi- 
ties is largely a matter of linear algebra, which is ideally suited for computational 
assessments, whereas sensitivities in models consisting of Michaelis-Menten rate 
laws require iterative algorithms that very drastically slow down the analysis (for 
a discussion see [18]). 

Second, S-systems are ideal for comparative analyses, because each parameter 
and each term has a uniquely defined meaning. For instance, the strength of the 
inhibitory effect of X 2 on the process vu in Fig. 3 is represented by a kinetic 
order with the symbolic name ^ 22 - Since ^22 represents an inhibition, it is known 
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immediately that it carries a negative value. If ^22 is close to zero, the inhibition 
is weak, and with increasing magnitude the strength of inhibition grows. 

These features of S-systems have led to the method of controlled mathematical 
comparisons (MCMC) (e.g., [6,14]), which forms the basis for many analyses of 
design and operation. As a typical generic example consider inhibition by product 
Xn, exerted upon the synthesis of the lead metabolite Xi. Suppose the synthesis 
of Xi is represented by the power-law term Xj^^ X^^^ X^^"^ , where Xq is 

the substrate for the process, and Xj and Xk are two modulators. The inhibition 
is represented by the kinetic order gin, which is a real number with negative value, 
indicating inhibition rather than activation. According to MCMC, a system A with 
inhibition is compared to an otherwise equivalent system B without inhibition in 
the following manner. First, the latter system B is characterized by gin = 0. 
Since then = 1 , this setting reflects that A^ does not affect the 

synthesis of Ai. Second, the two systems should be equivalent to the outside 
observer. For instance, their steady states, which are typically observed before 
the internal structure of the system is known, should be the same and of the 
value experimentally determined. To enforce this external equivalence, the term 
for synthesis of Ai in systems A and B should have the same numerical value. 
The difference in g\n is accounted for by adjusting a\ in system B, such that 

at the operating point. With 
the adjustments in g\n and a\, the systems behave exactly the same, for instance 
with respect to alterations in Xj or A^-, as long as A^, is not involved. At the 
same time, a comparison of their stability, sensitivities, robustness, and dynamic 
responses reveals the speciflc and unique effect that the inhibition by A„ has on the 
pathway. In this fashion, alternative pathway with or with particular regulation, 
such as in Fig. 3, have been compared, and these comparisons have shed light on 
speciflc advantages and disadvantages of observed and hypothetical systems and 
their operation. 



5. Discussion 

The dawn of the post-genomic era poses a tantalizing question: How can the enor- 
mous amount of genomic information be interpreted to yield true comprehension 
of the functioning of organisms? Several genomes are essentially sequenced in their 
entirety, and many more will follow within the immediate future. Yet, many facets 
of their regulation and metabolic interactions remain unclear. While there is enor- 
mous diversity among organisms and their responses to stimuli, it is to be expected 
that there are also general principles governing optimal functioning. Principles of 
one type, which have been studied for three decades, address the structure of meta- 
bolic pathways and gene circuits. They were called design principles and explain 
why pathways are composed and regulated in a particular fashion rather than in 
an alternative, theoretically possible fashion. They provide objective reason for 
why some genes are regulated through repression and others through induction. 
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Here I suggest that there are additional principles to be investigated, which may 
be called operating principles. These govern the dynamic responses to particular 
stimuli and complement the structural design principles in the same fashion that 
a user’s guide complements the hardwiring of an electronic appliance. The design 
principles, which determine the structure of a pathway or gene circuit, define the 
boundaries of possible responses of the organism, while the operating principles 
provide specific guidelines in response to biological demands that follow an ex- 
ternal stimulus. Operating principles are well known from other facets of life. If 
the design of a sonata is recorded as sheet music, the operating principles govern 
the actual performance of the sonata. Even if the body of law determines what is 
legal and what is not, courts and their officers are still needed to interpret the law 
in a given situation. It is worth noting that there is not necessarily a correlation 
between the simplicity or complexity of design principles and operating principles. 
A hammer has a simple design and it is also rather easy to operate, and a super- 
computer has a complex design and it takes a real expert to operate it. However, 
there are simple designs that require complex strategies of operation; examples are 
the game of chess and the violin. Conversely, machines such as cars and household 
appliances are designed in a complex manner with the specific intent of rendering 
their operation as simple as possible. How the complexity of design and operation 
relates in natural systems remains to be explored. One may surmise that operat- 
ing principles are indeed meta- design principles^ i.e., mere consequences of design 
implementations at different levels of organization. Even if this is the case, it is 
beneficial to investigate operating principles separately, because modern experi- 
mental techniques of biochemistry and genomics make operating principles almost 
directly observable, even if the molecular implementation of their design is far 
from understood. For instance, DNA microarrays and metabolic profiles permit 
the establishment of time trends in gene expression or metabolite levels following 
a stimulus, even though the intricacies of the governing gene regulatory networks 
and metabolic interactions are complex and far from evident. Thus, important 
operational features of the response can be measured, even though it may not be 
known in mechanistic detail how the specific responses are brought about. 
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Living cells are highly integrated dynamic systems composed of soft materi- 
als and a large variety of regulatory proteins. The internal architecture is by no 
means static, but highly dynamic allowing for directed responses to external and 
internal stimuli. This is a crucial design element for many cellular phenomena. 
Intensive research over the last decade has provided evidence that cooperativity 
and fluctuation phenomena play an important role in cell biology, in particular on 
the level of the very fundamental processes such a^ cytoskeletal organization, cell 
motility, and force generation by molecular motors. The reviews in this chapter 
span the range from molecular approaches focusing on the mechanical and dynam- 
ical properties of individual proteins to a more system-based view of whole cells 
and even cell complexes. 

Numerous experiments in vivo and in vitro have shown that the structural 
element responsible for the extraordinary mechanical and dynamical properties of 
eukaryotic cells is the cytoskeleton, a three-dimensional assembly of protein flbers 
such as act in filaments and microtubules. In order to understand the complex- 
ity of these composite soft materials we need studies focusing on the static and 
dynamic properties of individual filaments, as well as experimental investigations 
and modeling approaches of whole cells. 

The review of Wiggins and LeGoff on “Biopolymer Dynamics” shows how 
the physical properties of biopolymers are genuinely different from their synthetic 
counterparts such as polyethylene. In many instances a free polymer is already 
well characterized by two lengths, the total contour length and the persistence 
length (which is the mean correlation length of the tangent vectors). When the 
persistence length is much smaller than the total length, the polymer is said to be 
flexible and its conformations can be treated as a random walk. This is typically 
the C£Lse for synthetic polymers. When the two lengths are of the same order, the 
polymer is said to be semiflexible. Some of the most important biopolymers belong 
to the latter class. In particular, the structural elements of the cytoskeleton such as 
microtubules, actin filaments, and intermediate filaments have persistence lengths 
of the order of 6mm, 17//m, and 2/xm respectively. Wiggins and LeGoff describe 
how a quite unique combination of experimental and theoretical studies have led 
to a unified picture of the conformations and dynamics of biopolymers. 
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Molecular motors are another key element for understanding cellular pro- 
cesses. These nano-sized engines are a highly specialized class of enzymes which 
are able to transduce the energy excess in the chemical hydrolysis reaction of ATP 
(adenosine-triphosphate) into mechanical work. They can be classified into several 
different families (myosins, kinesins, dyneins are the most numerous ones) and are 
involved in a large variety of important cellular processes ranging from the trans- 
port of material and vesicles inside or outside the cell, to cell motility, cell division 
and RNA transcription. Parmeggiani and Schmidt give a well rounded overview of 
recent developments in this fascinating field. They try to find a common ground 
in theoretical and experimental description of the dynamics of motor proteins. A 
quite substantial emphasis is put on experimental methods and the emerging dif- 
ficulties, as well as conceptual difficulties of a theoretical approach which needs 
to account for a substantial number of degrees of freedom, being spatial as well 
as chemical. The key problem of the field seems to be to find the “right” level of 
coarse graining and to correctly link the various length and time scales. 

At present, simplified models with a small number of states are already quite 
successful in describing the effect of fluctuations on various mechanical properties 
of the motors as well as its efficiency. This contribution concludes with an outlook 
on future challenges, which deary will be to understand cooperativity in intracellu- 
lar transport. Some of the questions one may like to ask are as follows. What kinds 
of collective phenomena and cooperativity are possible in intracellular transport? 
Do non-equilibrium phase transitions play a significant role in regulating modes 
of transport? 

The contribution by Pompe et al. exemplifies the role of a specific protein 
(osteocalcin) as regulator of bone cell adhesion to “biomimetic” collagen surfaces, 
which are designed to be used as raw materials for bone replacement. Here the 
detailed and picturesque description of experimental assays gives insight into the 
geometry and dynamics of processes on both scaling levels, namely, how osteocalcin 
changes the materials surface properties on a spatial scale below iim and, how this 
alters adhesion and spreading of bone cells (osteoblasts) on such substrates, on 
a scale of more than 20 jj.m. The presentation concludes with an outlook to a 
potential description and analysis of the whole remodelling process of artificial 
bone implants. 

Spatz and coworkers focus on the mechanical response of whole cells when 
placed on micro-structured substrates. Such investigations are versatile model sys- 
tems for studying the mechanical effects on cell shape and cellular regulation. Of 
course, theoretical modelling of such complex systems based on a molecular ap- 
proach seems (yet) to be out of reach. Instead, alternative and more phenomenolog- 
ical concepts are needed. Cell models usually have to focus on a few characteristic 
features (shape parameters) which are described mathematically in terms of sto- 
chcLstic differential equations. In their review they exemplify how such a modeling 
approach can be used to quantify the deterministic and stochastic elements in the 
regulation of cellular response of melanocytes to mechanical signals. 
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To quantify cooperativity at the level of cell cultures is a challenging re- 
search field of enormous complexity. One has to understand how the activation of 
intracellular signal pathways through some external stimulus is transduced into a 
dynamic reorganization of the cytoskeleton and intercellular (or basal) junctions 
leading, in response, to a rearrangement of cell shape and motility within the cell 
culture. In the last contribution of this chapter, Dieterich and coworkers exemplify 
this by investigating shear stress induced cell migration in confluent monolayers 
of endothelial cells. Like at the inner surface of blood vessels, uniform or alternat- 
ing fluid flow applied in an in vitro flow chamber system (rheometer), induces 2-D 
orientation and polarization of cells with respect to the given flow direction. Quan- 
tification of this adaptive response is achieved by suitable image processing and 
capture of individual cell paths as well as of their projected area and perimeter. 
Subsequent extraction of characteristic indices (as velocity, orientation mean and 
variance, or shape index) reveal typical time courses of these quantities plotted 
over time after stimulation, with interesting correlations, also to corresponding 
data of the transendothelial electric resistance (TER) quantifying the averaged 
size of intercellular spacing within the confluent monolayer. Finally, these statis- 
tical evaluations are taken as a strong hint to explain the observed correlations 
between individual cell motility and tissue deformation by using individual-based 
multicellular simulation models. In particular, the authors suggest suitable gen- 
eralizations of the well known Voronoi tesselation method, which would capture 
the intercellular forces transduced via cytoskeletal filaments and specific contact 
proteins, as well as their regulation in response to outer stress stimulations. 
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1. Introduction 

Recently, many studies of the effect of rigidity on polymer dynamics have been 
carried out on biological polymers, such as DNA, actin, or microtubules. In the 
context of this chapter focused on cell mechanics, we will restrain our discussion to 
actin polymers. Actin filaments for which the persistence length is of the same order 
as the contour length (recent measurements converge towards a value around Ip ~ 
15/xm for a phalloidin stabilized filament, and around Ip ~ 10//m for an unstabilized 
filament) correspond typically to the regime of semi-flexible polymers. The physics 
of actin filaments as semi-flexible polymers has therefore been intensely studied 
over the last years. Here the motivation is both to use them as model semi-flexible 
polymers (taking advantage of their ease of visualization and manipulation), and 
to set microscopic basis for the understanding of cell mechanics. We will first go 
through the various techniques that can be used to study these chains. 

1.1. Electron Microscopy 

Electron microscopy allows us to see individual filaments and to resolve their finest 
details (such as their helicity). Recent low resolution developments also included 
the possibility to reconstruct three-dimensional structure by averaging many 2D 
images [1]. Although it has been used for ela^sticity measurements, electron mi- 
croscopy is not ideal however for physical investigations, because it requires the 
prior immobilization of the filaments by sticking them on a grid or by freezing. 

1.2. Optical Microscopy 

Optical microscopy offers the possibility to observe, with a limited spatial resolu- 
tion ( 300 nm), the time dependent semi-flexible dynamics of individual filaments. 
Actin filaments are proteinous filaments with a diameter of Snm that do not scat- 
ter light sufficiently to be directly observable by optical microscopy. It is however 
possible to label them with fluorescent probes, or to complex them with myosin 
and tropomyosin so as to make them diffuse light more efficiently (they can then 
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be observed by dark-field microscopy) [2]. Both of these techniques are quite in- 
vasive, this is why a method was recently proposed to label specifically the ends 
of the filaments instead of the whole filament by annealing at the extremities of 
short fluorescent filaments [3]. 

1.3. Light Scattering 

Light scattering has been used to study the dynamics of actin solutions for more 
than 20 years. Early attempts interpreted the dynamic structure factor in terms 
of sums of one or two single exponential decays reflecting the translational and 
rotational motion of rodlike chain solutions. As we will see below, taking into 
account the bending modes of the chains involves a reinterpretation of the structure 
factor in terms of stretched exponential. 

1.4. Colloidal Tracers 

We denote by colloidal tracers the use of microspheres to probe the dynamics of 
filaments. One can distinguish two distinct regimes. On the first end, when the 
beads are much smaller than the mesh of the actin solution, and when they are 
stuck on single filaments, one probes directly the fluctuations of the filaments [4]. 
On the other end, when the beads are much bigger than the mesh of polymer 
solution, the thermal motion of the beads is related to the viscoelastic complex 
modulus of the solution provided that the local response of the bead to a thermal 
stress relates the macroscopic response to a shear (mean field approximation). 
Since the viscoelastic modulus at high frequencies is dominated by single polymer 
dynamics, this micro-rheology technique is also a way to probe the dynamics of 
filaments [5]. 



2. Elasticity 

The basic energetic premise of semiflexible biopolymers is that they may be de- 
scribed as a homogeneous, isotropic elastic rod, sometimes called the “worm-like 
chain.” This is characterized by an energy which is a functional of the shape R(s, t) 
of the polymer: 









dsRi 



( 1 ) 



where subscripts indicate differentiation with respect to the arclength s, defined 
such that (R.)^ = 1. The material parameter k has units of energy X length and 
may be re-expressed in terms of the Young’s modulus Y and the moment of inertia 
/ or in terms of and the persistence length Lp\ 



K = YI = ksTLp 



( 2 ) 
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From this energy one may derive a force per unit length by functional differentia- 
tion: (keeping boundary terms): 

£ rp pL r-p 

f = = dsR,,-^ = + . (3) 

The boundary terms imply vanishing torque (normal to the curve) and force at 
the edges for a free polymer. 

The persistence length of filaments can be measured by a statistical analysis of 
the contour fluctuations by image analysis. The analysis can either be performed in 
real or reciprocal space [6-8]. Both methods agree on their estimation of Ip ~ 16//m. 
it seems however that mode analysis is quite sensitive to various kinds of systematic 
problems in data analysis. 



3. Hydrodynamics 

To derive a dynamic from this force per length, we consider the effect of the 
force per unit length on the fluid in which it is embedded. For polymers with 
length scales of order microns and motions on time scales of seconds (as, for 
example, in micromanipulation experiments), the Reynolds number in water is 
Re = UL/iy = = (lO”'^)"^ • I/IO""^ = 10“^’. We are thus safely in the Stokes 

regime, and the relevant fluid equation is 

0 = /iV“u - VP -f fpoly„H«r^'2(x - R(s)), (4) 

V • u = 0 (5) 

where the delta function in the first equation indicates that the polymer only exerts 
force at the location of the polymer. The subscript “2” on the delta function is a 
reminder that the the force is distributed on a line and therefore the delta function 
has units of length^^“'^^ =length“'^. The second equation, a constraint equation, 
determines the pressure P. A careful treatment of the problem [9] requires inverting 
the Laplacian, subject to incompressibility, and convolving the result against the 
force to solve for the fluid velocity. One then evaluates the velocity on the curve 
itself to derive the dynamic for The asymptotic result for slender rods is 

C(1 - ^R.R.,)R, = f (6) 

Where ( — Anfi/ ln(L/a) for a polymer of length L and diameter a. The denomina- 
tor is frequently referred to as “the logarithmic correction”, meaning that, had one 
pictured the polymer as a set of noninteracting units, one expects a drag constant 
proportional to /i alone. 

The projection tensor on the left hand side manifests the intuitive result that 
it is harder to drag a rod through molgisses normal to its axis than it is parallel to 
the axis (it turns out, in fact, that it is asymptotically twice as hard as the rod 
becomes of vanishing aspect ratio). For most problems, answers are changed only 
qualitatively by taking the drag to be isotropic, and we will do so below. However, 
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if one is interested in flagellar hydrodynamics, one must retain this anisotropy to 
derive any propulsion. 

4. Bending 

One may then investigate the behavior of a weakly-bent rod, i.e., one for which 
L/Lp = LksT / K <C 1. In this case, one can expand R (sz -f R||, Rj^) about an 
axis z and And 

C(B.j_)i = — /^(Rj_)ssss -h ^ (7) 

where ^ describes thermal noise. A surprising amount of utility can be gained 
from this equation. One may apply it in the absence of noise for micromanipula- 
tion studies of single biopolymers [10], use it in the presence of noise to describe 
the spectrum of fluctuations in a gel [11], or derive the perpendicular fluctuation 
correlation functions [5,12,13]. 

In fact, the deflnition of arclength allows one to derive as well the correlation 
function for longitudinal functions, since 

1 = R2 = (1 + (R||).,)2 + ((R^),)2 (8) 

gives us approximately (R||)s ~ Thus the transverse fluctuations de- 

termine the longitudinal fluctuations. 



5. Twisting 



The dynamic above fails to describe the twisting of a biopolymer. One may intro- 
duce a new quadratic energy to describe this physics. 




but this only has meaning upon deflning a material frame, described by ei, which 
may be thought of pointing from the center of the rod to a painted line on the 
filament which is straight when in mechanical equilibrium, and 02 = R^ x ei . Then 
== esdsBi and R^^ can be re-expressed as -h defined cyclically. 

Functional differentiation of the sum + F^twist gives a force and a new 

term, a torque, which is the term in the functional derivative conjugate to e 2 • , 

which describes variations of this frame which correspond to twisting about the 



axis. 



To close the dynamic, we must invoke a small amount of geometry to relate 
gradients in the term which will oppose rotational viscous drag, 83 ^ 281 ^ 1 , with the 
dynamics of the twist, 8 te 28 sei. The result is a closed system for the dynamics of 
bend and the dynamics of twist. For small deformations, we may write this as 



C8tK± = -k8^R± + z X C8s^3dsR± H (10) 

dt^3 = + Rs • R^S X Rsi (11) 

Got 
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where the second equation is exact. 

Kamien pointed out that this may be interpreted as a local statement of the 
topological conservation law Lk — Tw -\-Wr [14]. Tabor pointed out that the last 
term may be interpreted as a geometric phase [15]. Goldstein et al. [16] investigated 
the full, nonlinear dynamic, including the case when the ability to spin about the 
axis is fully suppressed (e.g., due to constraints arising from the structure of the 
rod), in which the diffusion- like term does not appear. 



6 . Stretching 



For a homogeneous, isotropic rod, the modulus for stretching is quite high. In 
general, it is related to the bending modulus by the square of the inverse diameter 
of the rod. For DNA, for example, the bending modulus is ksTLp with Lp ^ 50nm, 
whereas the stretch modulus is 7 = ksT/Lg where Lg ~ 1500/m. The geometric 
mean -yjLpLg ^ 10“*^m is of the order of DNA’s diameter. 

As a consequence, the time scale for stretching is far faister than the time 
scale for bending. Being interested in this slower (bending) dynamic, one typically 
uses the same mechanism used in fluid dynamics. Being interested in low-Mach 
number dynamics, we take the medium to be purely incompressible. In high di- 
mensions this gives rise to the pressure P, a Lagrange multiplier field determined 
by the constraint equation V • u = 0. Here, we have a one-dimensional problem, 
and the Lagrange multiplier field is a tension A, ^ and the constraint equation is 
R'^ = 1 , or, differentiated with respect to time, the one-dimensional analogue of 
incompressibility, R.^ • S^R^ = 0 . 

This then introduces a new energy and force per unit length: 



Estvv.tch 

/stretch 



9sARs 



(12) 

(13) 



which are inserted into the equations of motion and solved for using the constraint 
equation. 



7. Out-of-Equilibrium Dynamics 



In the presence of finite temperature, modeled via stochastic forcing, we have the 
full equations (for the moment ignoring hydrodynamic anisotropy) 



= 

{dsRf = 

Us, (0,0)) = 



1 

2kBTCS,jS{s)d{t) 



(14) 

(15) 

(16) 



^Avoiding r to avoid confusion with time or temperature and 7 to avoid confusion with shear 
rate. 
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equations which enforce force balance, inextensibility, and the fluctuation-dissipa- 
tion theorem, respectively. 



7.1. Nondimensionalization and Scaling 

It is reasonable at this point to choose a typical length scale, energy scale, and 
time scale, so that the variables in the problem are 0{1). We begin by choosing 
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R = -^qR 
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t = ^0^ 
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and dividing the equation of 
Dropping the tildes, we have 


motion, with units of force per length, by 
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It is reasonable at this point 
k^ThilLl]. We then have 


to choose ^0 = CL‘o/k and (^y = kuTC^/L 


0^0 = 


dtK 


= a;'R + a,Aa,R + . 

K y Lp 


(25) 


{d,Rf 


= 1 


(26) 




= 26,,8{s)8{t) 


(27) 



Note that we have yet to choose a characteristic tension Ao nor a characteristic- 
length L(). 

7.2. Infinite Polymer: Lo = Lp; Ao = ki^T/Lp 

For an infinite polymer, we must choose Lo = Lp, the only length scale in the 
problem, and Ao = ksT/Lp^ the only energy scale in the problem, to find 

dfjl = + 9.sA9sR + ^ 

(a,R)2 = 1 

(6(s,i)^,(0,0)) = 26,jS{s)S{t) 



(28) 

(29) 

(30) 
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7.3. Pulled, Finite Polymer: Lq = L; Aq = Fq = 

One sometimes wishes to describe (9(1) pulling, in which the origin of tension 
is not the thermal driving.^ We then introduce and, if one chooses 

A„ = Fo = !kIL\ one then finds 



dtR = -dtR + fdsAdsR + e^ 


(31) 


(5.R)^ = 1 


(32) 


s,t)e,(0,0)) = 2S,jS{s)S{t) 


(33) 



7.4. Free, Finite Polymer: Lq = L; Aq = ksT/Lp 

The case of general interest for semifiexible biopolymers is L < Lp which are free, 
so that the origin of the tension is the thermal driving. We then have 



3tR = (34) 

(5«R)' = 1 (35) 

Ms, {0,0)) = 25,j8{s)6{t) (36) 

To lowest order, then, let us begin by dropping terms of 0{e'^), and parameterizing 
the curve as R = (s — er). The transverse component obeys 

d,r = -ay + C (37) 

dir,, = + (38) 

r„ - e-"'' f (39) 

J —OO 

{r,,{t)rk{0)) = S{q + k)q~'e~‘>'' (40) 



in accord with the equipartition theorem resulting from the first non-vanishing 
term in the expansion of the Hamiltonian, and inextensibility gives 



\-2e^p' + e^r'^ = I + 0{e^) 



(41) 

(42) 



The first of these gives the dynamic correlation for transverse displacements. 



((Ri(s,t)-Rx(0,0))'^) 



((r(s,t)-r(0,0))^) 
f dql- 

J 2n 



t-*/’F(st‘/'^) 



(43) 

(44) 

(45) 



with F(0) w .38. Longitudinal displacements may be computed from Eqs. (8), and 
using Wick contraction on the expectations of r^. The result has the same scaling 
in time [12, 13]. 



■^This may be thought of as a boundary condition in the constraint equation for tension, not 
unlike a boundary condition on the stress (and thus the pressure) when one boundary of a fluid 
is subjected to external forces. 
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A recent calculation [17] seems to show that the first correction, however, 
scales as . We remind the reader that the above calculations are predicated 
on short filaments (L Lp) in the absence of externally applied tension (Aq = 
^bT / Lp). 

The subdiffusive behaviour of Eq. (45) was first pointed out in order to inter- 
pret dynamic light scattering experiments [18,19]. In analogy with scattering from 
flexible polymer solutions, the dynamic structure factor of a semi-fiexible polymer 
solution obeys to a stretched exponential behaviour oc e~^ ^ 

The dynamics of semi-fiexible polymers can also be measured more directly 
by probing the random motion of colloidal tracers embedded in an actin solution. 
The colloids present a subdiffusive behaviour with a (SR‘^{t)) oc power for the 
mean square displacement [11]. Other measurements also based on colloidal trac- 
ers, inferred the same power law behaviour for the direct transverse fluctuations 
of individual filaments in a semi-dilute solution [4] or the visco-elastic modulus 
probed by micro-rheology [5]. 

Eventually, the full spectrum of the dynamics of individual actin chains, 
including both the cx power law at short times and the saturation towards 
equilibrium at longer times, has been measured by single filament assay [3]. It was 
shown that the dynamics of end-to-end distance can be described by a single scaling 
law for filaments of all lengths. This measurement, presented in Fig. 1 together 
with the radial distribution function (RDF) of Fig. 2, which was obtained from the 
same experiments, represents the first joint estimate of both statics and dynamics 
of semi-fiexible polymer fluctuations. In order to quantitatively account for the 
end-to-end fluctuation, it is necessary to introduce incompressibility of the chain. 



8, Statistical Mechanics and Inextensibility 

A number of experimentally-relevant statistical quantities can be calculated from 
the fluctuation spectrum of r^: 



8.1. Force-Extension Relation 



The spectrum of fluctuations may be derived in the presence of an 0{l) applied 
force from Eq. (31): 



(|r„l^) = 5{q + k)-^ 
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(46) 



from which one may derive the force-distance relationship for longitudinal exten- 
sion (in the high-force limit) via inextensibility. 
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Figure 1. Rescaled dynamics for end-to-end fluctuations of actin 
filaments. The y-axis represents the mean square displacement of 
end-to-end distance for 8 filaments, renormalized by their plateau 
value F={ml)SR^{L,t)/L\ The x-axis represent a rescaled time 
t/r\, where t\ is the cross-over time for the dynamics t\ cx 
The solid line is the prediction for a weakly bending rod in two 
dimensions. Inset is the raw data before rescaling. Taken from [3] 



In the low- force limit, the sum is sometimes approximated by 






X{0) + fX\0) 




(50) 

(51) 

(52) 

(53) 



Remember that this expression is valid for small L/Lp and small Ft? Jk. 

In the high- force limit, we may replace the sum with an integral to find 






(54) 



8.2. Radial Distribution Function 

In the absence of external tension, one may use the spectrum found in Eq. (40) 
(at equal time), along with inextensibility, to derive the probability distribution 
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r{Mm) 



Figure 2. Measured radial distribution function (bars), and the- 
oretical expression of Eqn 60(line) with ip = 16.1 ± 0.2/im and 
Lc — 13.40 zb 0.01/im. Figure taken from [3] 



function for a given projected end-end length, 
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where is a normalization constant. In dimensionful form, this is well approxi- 
mated by 



where 



X > 0.2 




(60) 



(61) 



and is plotted in Fig. 2. 

The conformations of a chain are measured most efficiently by video mi- 
croscopy on single filaments. Moments of the radial distribution function (i?^) 
have long been measured in order to estimate the persistence length of actin fil- 
aments [2,20]. However only the measurement of the full RDF can assess the 
semi- flexible nature of actin filaments. This was done recently, when an accurate 
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measurement of the end-to-end distance with the help of end-tags allowed a quanti- 
tative comparison of an experimental radial distribution function and a prediction 
in the context of the wormlike chain model [3] (see Fig. 2). 
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1. Introduction 

Motor proteins are examples of machines working at the molecular, i.e. nanometer 
scale. How exactly such machines function in a noisy environment has been a mat- 
ter of a debate since the discovery of the myosin cross-bridge in muscle sarcomeres 
by A.F. Huxley [31]. 

New experimental techniques have given access to both, the microscopic structures 
and the dynamics of motor proteins, beginning to clarify their role in a multitude 
of phenomena ranging from bacterial motility, via muscle contraction and ciliar 
beating to DNA replication and transcription [1, 5]. 

Research hcus been progressing on a wide front, ranging from (atomic) structure de- 
termination by X-ray crystallography and electron microscopy, via the detection of 
chemical pathways with the techniques of enzymology, to muscle fiber experiments 
on large ensembles of motors and to single-molecule measurements of molecular 
motions and forces. In parallel to experiments, models have been proposed, both, 
specific ones focused on a particular system and generalized universal ones. The 
wide variety of scientific disciplines, including biology, physiology, biochemistry 
and physics which has been involved in this research, has sometimes caused com- 
munication problems. The tradition in the life sciences is to explore specific systems 
in all detail, whereas the goal in physics often is to abstract from the details and 
search for universal principles. It is still not clear in how far it will be possible to 
describe the dynamics of motor proteins by universal principles that go beyond the 
level of mere generalities. It has, however, been shown that stochastic motion of 
molecules driven by chemical non-equilibrium even in the simplest models can give 
rise to unexpected, non- trivial effects, which may very well occur in real protein 
systems. 

In this short article, we will, without striving for completeness, focus on a few cen- 
tral topics that have been addressed in theoretical models, trying to clarify some 
concepts that are crucial in connecting simplified models to reality. 
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2. What are Motor Proteins? 

As far as we will talk about specific proteins, we will restrict the discussion to 
cytoskeletal motors. The main reason for this restriction is that these are the sim- 
plest motors in structure, much less complex than DNA and RNA manipulating 
enzymes, the fiagellar motor of certain bacteria, or the ATP producing FoFi- 
ATPsynthase complex. Cytoskelatal motors are classified by sequence similarity 
into three superfamilies, the myosins, kinesins and dyneins [26]. These motors are 
involved in phenomena such as mitosis and meiosis, intracellular transport, move- 
ment and growth of eukaryotic cells, as well as in muscle contraction, ciliar and 
flagellar beating, and in sound detection by hair cells. 

Motor proteins are characterized by their ability to convert chemical free en- 
ergy into mechanical work by cyclically binding and hydrolyzing the nucleotide 
adenosine-triphosphate {ATP) into adenosine-diphosphate (ADP) and inorganic 
phosphate (Pi). This chemical reaction provides the free energy that drives at 
least two conformational changes of the protein and at the same time switches 
the affinity to the cytoskeletal semiflexible filaments serving as tracks, actin for 
myosins and microtubules for kinesins and dyneins ( ‘Tymn- Taylor mechanisms” 
[51]). One conformational change, usually called “power-stroke”, occurs when the 
motor is strongly bound to the filament, producing force or motion, the other one, 
called “recovery-stroke” , occurs while the motor is detached from the track. Thus 
a chemical cycle is coupled to a mechanical cycle. 

Different realizations of the cycles and of the mechanochernical coupling can ex- 
plain the different dynamical behaviors of motor proteins and their roles in spe- 
cific biological systems. Well studied examples for the range of possibilities are the 
neuronal transporter (conventional) kinesin, and the muscle motor myosin II [26]. 
Conventional kinesin, powering axonal transport of vesicles towards the synapse 
in neurons, is a tetramer of two identical heavy chains and two light chains. It can 
stay attached to its track over about 100 ATP hydrolysis cycles, a property called 
“processivity” . Processivity increases the efficiency of transport when driven by 
a small number of motors, but it limits the speed of transport to the chemical 
turnover rate of a single motor multiplied with the step produced per ATP hy- 
drolyzed (about 1 /i,m/s for kinesin). Myosin II, on the other hand, is characterized 
by a very short (compared to the whole cycle time) strong interaction with the 
actin filament, during which it performs a power-stroke. A single motor will there- 
fore not stay attached to the track after a power stroke, making it a non-processive 
motor. Myosin II is optimized to work in large assemblies such as muscles h 
Processivity is not a feature specific to all the motors belonging to the same su- 
perfamily. In recent years. Myosin V has been found to be processive [55], while 
the kinesin-related ncd is not [2, 83]. There is an intermediate scenario, motors 
that can not produce connected steps, because they for example have only one 
head, but still stay associated with their track over longer periods of time due to 

^The short interaction time increases the maximal shortening speed of muscle by cooperative 
displacement generation (~ 6 — 10 jjim/s) [26]. 
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non-specific attractive interactions. Such behavior wais found for a single headed 
motor, a truncated construct of the KIFIA kinesin [61]. This motor can produce 
a directed motion along the microtubule, alternating forward steps with diffusive 
phases along the track. The reason why this motor stays attached to the micro- 
tubule is probably that it has a strongly positively charged loop (the K-loop) that 
interacts non-specifically with the negatively charged microtubule. This motor is 
not mechanically processive and does not produce motion under load, and there is 
evidence that in its physiological environment (at higher concentration) it forms a 
processive dimer [88] . The interaction with the relatively rigid and polar cytoskele- 
tal filaments allows the ’’linear” cytoskeletal motors^ to drive directed motions or 
apply directed forces inside the cell. The longitudinal axis of the filament fixes the 
axis of motion while the filament polarity fixes the direction. Most myosins move 
towards the so-called “barbed end” (feist growing end) of actin, and most kinesins 
move towards the so-called plus end of the microtubule, while dyneins move in the 
opposite direction'^ 

3. Structural Properties of Motors and Filaments 

In recent years, on the order of a hundred different motor proteins have been 
identified in each of the superfamilies within and across organisms mostly by ge- 
netics [57, 80, 35]. Structural techniques like X-ray crystallography and electron 
microscopy have produced data on a number of motors, on actin filaments and 
microtubules and on the interaction of motors with their tracks (“decoration ex- 
periments” [53, 25] )T 

Myosins and kinesins have striking similarities in the atomic structure of their 
enzymatic cores although their amino acid sequences are very different. They are 
believed to have a common ancestor, also shared by G-proteins^ enzymes involved 
in cellular signalling [36, 43]. Dyneins are structurally distinct. 

Myosin II and conventional kinesin both form a dimer of identical heavy chains 
with cissociated light chains. They dimerize via an alpha-helical coiled coil between 
the “stalk” regions of on the order of 100 nm in length. A roughly globular struc- 
ture of about 10 nm in size, the “head”, at one end of the stalk is the motor 
domain. The head contains the binding site to the filament and the nucleotide 
binding pocket where nucleotide binding, hydrolysis and release take place. In 
myosin II the stalk is also responsible for the polymerization of myosin dimers to 
the functional multi-motor aggregates in muscle, so called “thick filaments” [1]. In 

■^There are also rotary motors such as the FoFi-ATPsynthase or the bacterial flagellar engines, 
both extensively studied in the last years by using single molecule techniques [60, 74]. 

'^The directionality of the motion is also not a conserved property for all the motors belonging 
to the same superfamily: myosin VI [101] as well as the kinesin related ned [54, 100, 22] move in 
the opposite direction with respect of the majority of the proteins of the same superfamily. 

complete classification of all motor proteins is far from being finished. Only a handful of 
structures have been solved and only a small number of motors have been characterized in 
motility assays. 
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Figure 1. Cartoons of the filament and motor structures. (For 
colored picture see color plate 3). 



myosins a relatively stiff alpha helix is directly connected to the catalytic core of 
the motor, functioning as a lever arm amplifying a small conformational change 
near the nucleotide binding pocket to a larger motion of the stalk, or possibly 
storing the ATP energy elastically if the end of the stalk is prevented from mov- 
ing. In conventional kinesin, where the two heads act cooperatively to propel the 
motor processively along the microtubule in 8 nm steps, it is far less well known 
what conformational changes of the heads underlie this progression. It has been 
shown that the regions directly adjoining the head domain, the “neck linker” and 
the “neck” are crucial for processivity and directionality [100, 72, 102, 91]. In the 
non-processive kinesin-related ncd there is an alpha helix in the place of the neck- 
linker, again believed to act as a lever arm [10]. The other important ingredient 
of motor motility is the track filament structure. Both actin filaments and mi- 
crotubules are polar semiflexible protein polymers'". Actin filaments are built as 
a double helix of roughly globular actin monomers with a monomer periodicity 
of about 5.5 nm along one strand of the helix and a helical periodicity of about 
72 nm. Microtubules are hollow cylinders assembled from 13 parallel “protofila- 
ments” each a head-to-tail chain of dimers of aj3 tubulin providing a straight track 
with a periodicity of about 8 nm, see Fig.l. 



4. Motility Assays and Single Molecule Experiments 

We will give here, without striving for completeness, a short overview over some 
of the experimental approaches to motor proteins. While more extensive reviews 
should be consulted for details [16, 87, 91, 102, 20, 26, 70], we will focus on the 



'"^Both filaments are non-equilibrium polymers, hydrolysing nucleotides when polymerizing. 
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most important achievements and also on some of the problems and limitations en- 
countered in these experiments. Muscle, a large, highly organized system of motor 
proteins has been studied for many decades [31, 7]. Even the smallest physiolog- 
ical units commonly studied, single isolated muscle fibers, still consist of billions 
of motors acting together. This makes it hard to deduce the dynamic behavior of 
the individual motors, since strong assumptions have to be made about the ho- 
mogeneity of the system. About 20 years ago, the first so-called in vitro motility 
assays were developed with myosin [81], showing that motor proteins can be made 
to move and exert forces in fairly well controlled in vitro settings with a minimum 
number of ingredients. These assays usually come in one of two geometries (see 
Fig. 2), “surface gliding assays” where the motor is attached to a substrate and the 
track filaments are observed moving across that surface, and “bead assays” , where 
the motors are attached to micron-sized colloidal particles and are observed mov- 
ing along surface-bound track filaments. At first relatively large numbers (100s) of 
motors were observed moving filaments or beads [84, 89, 40]. Soon, however, it was 




Figure 2. Scheme of “surface gliding assays” and “bead assays”. 

shown to be possible to dilute the systems to the point where only a single motor 
was moving a filament [28] or driving a bead [4]. This experiment was relatively 
easy with neuronal kinesin because a single kinesin motor can stay attached to the 
microtubule on which it is moving for on the order of 100 ATP cycles, whereas 
muscle myosin only spends a short fraction (~ 5%) of its mechanochemical cycle 
time in a strongly bound state. Kinesin is therefore a “processive” motor, while 
myosin II is a “non-processive” motor. Surface assays were well suited to study the 
dependence of transport speed on ATP concentration and on motor number . It 
was found that kinesin can move microtubules maximally at about 600 nm/s, while 
myosin can move actin at maximally 10 fim/s. When the motor concentration is 
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lowered, both myosin and kinesin keep moving their tracks at a constant speed, 
demonstrating that at saturating ATP concentrations, the motors are reaction- 
limited, i.e. the viscous load of the filament gliding in water, even though it is 
close to a surface, is not relevant. For kinesin, single- molecule driven movement 
was observed at low motor concentrations, manifested by so-called “nodal-point 
pivoting”. The filaments were rotationally diffusing around a single pivot point, 
the point of motor attachment, while gliding past that point, which is consistent 
with their processive character. Such a behavior was not observed for the non- 
processive myosin: actin filaments would not bind at all anymore to the surface 
below a certain threshold concentration of myosin. Adding a rod-like polymer sus- 
pension (methyl cellulose) to the buffer did have the effect of keeping the actin 
filaments on the ground. Now they could be seen moving at decreasing velocities 
when the surface motor density was further lowered (for an extensive collection of 
specific references to the preceding findings see [26]). 

Substantial efforts were focused on detecting elementary events in the single- 
molecule motility assays. With kinesin it was eventually possible to detect steps 
in the motion records of bead assays of about 8 nm amplitude, occurring at ap- 
parently random times [85]. This was made possible by using optical tweezers to 
manipulate the beads and laser interferometry to measure nm scale motion with 
high temporal resolution [16, 82, 18]. The 8 nm correspond to the spacing of 
equivalent binding sites along an individual microtubule protofilament, and it was 
known from earlier experiments that kinesin follows the protofilament axis [44, 69]. 
Observing 8 nm steps therefore means that the whole kinesin dimer moves for- 
ward by one tubulin dimer in one step, not more and not less, ending up in the 
same physical conformation from which it started. The distance stepped needs to 
be commensurate with the 2D lattice of binding sites, or maybe more strictly the 
ID lattice of a protofilament. Regarding the two (a priori identical) heads in a 
dimer, there are two possibilities for the conformation after one step. Either the 
leading head remains the leading head (“inchworm” model) [29] or the two heads 
exchange roles (“hand-over-hand” model) [27, 77, 75]. The maximal force a single 
kinesin dimer can exert was found to be close to 7 pN [79]. It wets also shown that 
exactly one ATP molecule is needed to perform this step [78]. It is not clear to this 
day what conformational changes and intermediate events take place during this 
stepping process and which of the two possible models apply. Evidence has been 
reported [29] that excludes a symmetric hand-over- hand mechanism, because the 
expected 180 degree rotation of the load could not be detected even at extremely 
low rates of stepping. An important limitation of the single-molecule motility a.s- 
says is that one can in most cases only observe the motor while it is bound to 
the track, i.e. any conformational changes taking place while the motor is not 
bound are usually not accessible. Furthermore it is always necessary to filter out 
Brownian noise when aiming for nanometer position resolution, either by low-pass 
temporal filtering or by ensemble averaging over many equivalent events. In any 
Cctse one can therefore only observe the states in which the motor spends sufficient 
time, i.e. the states before the rate limiting transitions. The state one can stretch 
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at will is ATP binding, by lowering the ATP concentration. At saturating ATP 
concentration the rates are often so fast that detection becomes challenging. For 
example, the maximal rate of stepping for kinesin is about 100 per second. At high 
ATP concentrations one can still slow down load-dependent steps by holding back 
on the motor. While one can observe the progress of the motor from binding site 
to binding site with a processive motor such as kinesin, this is not possible with a 
single non-processive motor such as myosin II, because it unbinds from the track 
after exerting force. Modified in vitro motility assays were developed that made 
it possible to detect single disconnected force generation events with myosin II 
[14, 56, 92] and the kinesin related ncd [10] in a so-called “three-bead” geometry. 
In this assay the track, an actin filament or a microtubule, is suspended with the 
help of two optical traps over a bead connected to the substrate. The stationary 
bead carries a motor. In this geometry the reaction partners can not diffuse apart, 
and what is observed are the displacement or the force exerted by the motor on 
the suspended filament, i.e. the power-stroke. The myosin molecule is in a different 
conformation at the end of this power-stroke. In other words, what is detected is 
the difference in the internal molecular conformation of the motor before and after 
the power-stroke. This is in contrast to the processive kinesin where the dimer 
is more or less in the same internal state after the step, possibly with the two 
heads interchanged, and, although they surely occur, so far no intramolecular con- 
formational changes have been convincingly detected. An intrinsic problem with 
measuring power-strokes is that the actual displacements or forces measured in 
the assay depend on a variety of boundary conditions, involving elastic compli- 
ances in the molecule and factors such as the length of lever arms, even when the 
load on the motor is kept constant. It was for example found that the amount of 
displacement observed is monotonously decreasing with the strength of the optical 
traps holding the filament [15]. When observed with weak traps, the displacement 
was measured to be proportional to the length of the lever arm of the molecule, 
the neck or light-chain binding region, which can be genetically manipulated [73]. 
Obviously, there is no direct connection between the displacement generated at the 
end of the motor opposite to the binding site with the track and the periodicity 
of the track itself. Consequently it is not expected that the power-stroke is in any 
way commensurate with the track. Three-bead assays with non-processive motors 
offer the possibility to detect at least two specific time points in the mechanochem- 
ical cycle, namely the time of binding and the time of release, usually detected 
by monitoring the variance of the probe motion. The power-stroke can occur too 
close in time to one of those time points to be resolved separately. This appears 
to be the case for myosin where it occurs within milliseconds after binding. For 
the ncd motor the power-stroke occurs about 100 milliseconds before unbinding 
and one can therefore detect three time points in the cycle. Some myosins were 
found to produce a substep during the bound interval [93]. One has to be careful 
to determine an appropriate reference level for displacement measurements during 
power-strokes. In the transition from unbound to bound state the reference level 
can not be measured for an individual event since it is only known within the 
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rather broad range of Brownian motion. In this case only ensemble averages are 
meaningful, and for myosin II the average power-stroke has an amplitude of about 
5 nm [38]. Once the motor is bound, the reference level is better established, such 
as for the processive 8 nm steps of kinesin or for the substeps in myosins I, as well 
as for the power-stroke of ncd. 

A closer look at the statistic of binding/unbinding or at the time intervals between 
steps for processive motors can reveal information about the underlying chemical 
pathway. At low ATP concentration one can determine if one ATP or more are 
necessary for one event, one ATP leading to a Poisson distribution of intervals, as 
was seen for kinesin [86, 76, 78] and ncd [10]. At high ATP concentration kinesin 
showed two roughly equal rate limiting steps, ascribed to the two heads acting in 
series. 

There are certain general limitations on micromechanical experiments which one 
needs to be aware of. First of all, as already mentioned, there is always Brownian 
motion, adding noise to the position or force signals. Second, there are always 
elastic compliances and viscous damping. The motor molecules themselves have 
elastic properties and experience internal and external viscous losses. In all the 
assays, the motors are attached to some solid surface via more or less elastic link- 
ers. In the three-bead assay, the track protein is attached to probe particles via a 
compliant linkage. Compliances in series between the motor and the probe used 
to monitor position have two effects [18]. First, if the probe itself is held by an 
elastic force, such as an optical trap, the amplitude of displacement will be at- 
tenuated. Second, the compliances together with viscous drag impose a low-pass 
filter on the detection, making it impossible to detect fast transients in the motor 
conformational changes. The stiffer all the attachments are, the faster and the 
less noisy will be the detection. The compliances in the attachment chemistry are 
often non-linear, making the system stiffer under an applied load. On the other 
hand, for the displacement resolution it plays no role how stiff the probe is, i.e. 
how strong the laser trap is, because the signal- to- noise ratio does not depend on 
probe stiffness [18]. The most relevant parameter is the viscous drag on the probe. 
A small probe together with appropriate low-pass filtering decreases the noise. 

A further sophistication in the experiments is the use of feedback circuits to control 
the force the probe exerts in response to the activity of the motor. One possibil- 
ity is a ’’position clamp” [18]. A classical experiment with muscle fibers is to use 
’’isometric” conditions, which means that feedback is used to modulate the force 
on the fiber in such a way that the length of the fiber remains constant. In a 
single-molecule motility assay this translates to keeping the position of the probe 
constant. What this means for the motor is a rather subtle question and depends 
on several parameters characterizing the system. First of all, the feedback loop 
always contains low-pass filtering both, to keep it stable and intrinsically, for ex- 
ample, through the viscous drag on the probe. The effect of this is that feedback 
is only in effect for slow enough dynamic events. Second, with series compliances 
in the connection between motor and probe, the amount by which the motor or 
parts of it can move depends on the relation between the various compliances. For 
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myosin in a three-bead assay the amount of compliance could make the difference 
between completion of the power-stroke while stretching a soft series compliance 
or preventing the power-stroke with stiff compliances. 

The other possible feedback implementation is a “force clamp” [18], where the 
optical trap holding the probe is moved so that the force exerted on the motor 
remains constant. This has been done in kinesin bead assays [79], eliminating the 
necessity to correct for the effect of series compliances under changing forces, and 
allowing force-velocity curves to be measured with good statistics. 

In many in-vitro motility assays force is exerted on the motor in some way. In 
that case it is certainly relevant not just how much force is exerted, but in which 
direction it is exerted. The primary distinction usually made is between forward 
and backward force, backward force slowing or stalling the motor, forward force 
possibly accelerating the motor. Force-velocity curves have been measured mostly 
under backward forces [86, 79] and their shape can give clues about the load- 
dependent transitions in the mechanochemical cycle. It is not unlikely, though, 
that the exact direction of force is important, not just the rough distinction in 
backward and forward. There are experimental indications that an upward com- 
ponent of the force on kinesin might accelerate the motion [17] . There also is an 
asymmetry between the motions of kinesin under left and right lateral forces [79]. 

4.1. Some Open Problems 

The most burning open questions about cytoskeletal motor dynamics which await 
experimental answers are the following. For myosin II there persists a debate about 
the amplitude of the power-stroke and its dependence on lever arm length, cis 
well as a possible involvement of the actin filament in the force generation. Some 
studies propose a loosely- coupled mechanisms in which the conformational change 
is generated by a series of steps per ATP molecule hydrolyzed adding up to a 
displacement large compared to the dimension of the head [37]. 

Although it is almost undisputed that kinesin needs coordination between the two 
heads to move processively [21, 26], it is still unclear how this coordination is 
achieved and how the two heads move, in a (symmetric or asymmetric) hand-over- 
hand or in an inchworm motion. There is no reliable direct evidence for substeps 
yet. It is also not clear yet what determines the varying directionalities in the 
diverse myosins and kinesins. 



5. The Theoretical Approach 

Recent years have seen a variety of theoretical models for molecular motors, exten- 
sively reviewed in [3, 45, 71]. Here, we would like to clarify some selected concepts 
on which the mathematical description of motor proteins is based. We will review 
a simple example of a two-states model based on which we will highlight the role 
of fluctuations in the mechanochemical transduction operated by motor proteins, 
and discuss the concept of efficiency. 
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5.1. The Real System 

The dynamics of macromolecules such as proteins cover an extremely wide range 
of time scales, starting from picoseconds when single bonds rearrange and reach- 
ing hours or years for large-scale, slow (glassy) rearrangements, escapes out of 
metastable states. Functionally relevant dynamics of proteins occur on all of these 
scales. The dynamics of motor proteins that are crucial for force generation involve 
subdomains of 10s of nanometers in size and occur typically on a microsecond 
scale^’. For coarse-grained modelling it appears safe to make use of the separation 
of time scales, neglecting the fast (i.e. very rapidly thermally equilibrated) intra- 
domain dynamics, and to focus on the slow dynamics of larger structural units, 
such as certain crucial alpha- helices, the neck or the lever arm. This leads to a 
situation where, despite the complexity of their three-dimensional structure, mo- 
tor proteins can be described by a relatively small number of degrees of freedom. 
This number, which can be estimated from structural data (see also section 3), 
is on the order of tens to one hundred, small compared to the number of atoms, 
or even amino acids composing the protein. It is intuitively attractive to visualize 
the parts involved in the large conformational changes of the protein as elements 
of a macroscopic engine (springs, pistons, levers etc., connected by joints) with 
complex mechanical properties and mutual interactions. At the catalytic core of 
the motor domain (engine), the ATP (fuel) is cyclically hydrolyzed (combustion) 
and provides the energy necessary to drive the whole mechanism. 

At this level of description, a theoretical approach would require the definition 
of a complex multidimensional ” mechanochemical” Lagrangian or an “effective 
potential” defined by the mutual interactions among the elements of the engine, 
the track and a driving term associated with the fuel consumption. The motor 
dynamics could then be mathematically written in terms of a complicated system 
of coupled partial differential equations, describing all the (on the order of 100) 
degrees of freedom involved. With skill, luck and a powerful computer, it might 
be possible to compute the trajectory of the motor in three-dimensional space, its 
position along the filament, and the specific nucleotide bound to it. One might 
hope to understand motor protein function in (almost) microscopic detail in such 
a manner. 

There is, however, one fundamental problem with this approach. Motor proteins 
as well as other enzymes are rather different from macroscopic engines in that they 
are extremely small systems, working in watery environments at around room tem- 
perature. Under such conditions the “engine dynamics” is strongly influenced by 
thermal fluctuations and other stochastic properties of the environment like for 
example the availability of fuel (i.e. energy is available only whenever the motor 
encounters a diffusing ATP molecule). 

The forces exerted by the directed conformational changes of a motor protein are 



^’Techniques such as ab initio computation or Molecular Dynamics can describe up to nanosecond 
dynamics, but not yet these relatively slow and collective dynamics of motor proteins. 
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probably somewhat larger, but roughly of the same order of magnitude as the ran- 
dom forces exerted by the surrounding medium. This means that an analysis in 
terms of “macroscopic” classical mechanics is not appropriate, and it is necessary 
to describe motor proteins in terms of stocha^stic processes and thermodynamic 
variables (keeping an eye on the overall non-equilibrium character of the systems) . 
Thermodynamic variables, such as free energy or chemical potential are only de- 
fined and meaningful as averages over statistical ensembles, so it is important to 
keep in mind that one can not talk about the free energy difference in a single 
molecule conformational change. In other words, the system, i.e. the motor pro- 
tein can always borrow significant amounts of energy from the reservoir or may 
have to pay back those amounts at any given moment in time. 

The task for modelling now becomes to marry a mechanochemical description of 
the degrees of freedom with the stochastic aspects of the process. This has been 
approached in a few different ways, all of which have drastically further reduced 
the motors’ degrees of freedom to focus on essential properties. As a first step it 
is useful to look at the most important physical properties and limitations which 
are consequences of the small size of the motor and the characteristics of its inter- 
action with the cytoskeletal filament. 

First, motor proteins work out of thermodynamical equilibrium: the ATP hy- 
drolysis, ATP ^ ADP -f Pi, cyclically provides on average a free energy of 
~ 15 — 25 kuT. This energy is considerably larger than the energy associated 
with the thermal fluctuations acting on a Brownian particle in thermal equi- 
librium with its surrounding environment (1 k^T ~ 4 pN • nni). The available 
energy can be measured in terms of a chemical potential difference defined by^ 
Ap = pa'i'p — k'ADP — l^p, = Apo -I- k}^Tln(\ATP\/[ADP][Pi\). Here, the use of 
a chemical potential Ap is consistent with the fact that motor proteins are ther- 
modynamically open systems and the energy available for their motion depends 
also on the entropic contributions of the nucleotide and product concentrations 
[ATP], [ADP] and [Pi] in solution. Indeed, the equilibrium of the hydrolysis reac- 
tion is controlled by these concentrations and, therefore, also the average behavior 
of the motor. In the presence of a large excess of ADP and Pi over ATP, the mo- 
tor would run backwards^. Implicit in the use of thermodynamic variables is the 
assumption that the motor works in the presence of large numbers of ATP, ADP 
and Pi molecules (chemical reservoir). Under the normal conditions in a cell an 
excess of ATP, i.e. Ap > 0, breaks the equilibrium of the chemical reaction and 
makes it proceed more or less irreversibly in the direction of hydrolysis of ATP 
into ADP and Pi. 



^A/io is the chemical potential difference in standard conditions. 

^Note that the backward reaction is practically impossible to realize just by changing concen- 
trations since an extremely high concentration of ADP and P\ in solution would be required. 
The equilibrium constant for the ATP hydrolysis is [ADP]eq[Pi]cq/[ATP]cq = M\ [26]. The 
backwards reaction does occur in the case of motors coupled to another engine like, for example, 
in the case of the FoFi-ATPsynthcise. The free energy necessary to drive the chemical reaction 
in the direction of ATP synthesis is supplied by the proton-driven Fo part of the complex. 
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Second, the motor-filament interaction is polar (see section 2). The polarity of the 
track filament causes the (non-symmetric) motor protein to bind in a well defined 
orientation, which in turn will direct conformational changes, such as for example 
lever arm power strokes in a certain direction with respect to the filament. While 
the non-equilibrium condition unbalances the ATP hydrolysis reaction allowing 
the motor to dissipate energy, the polarity of the motor-filament interaction is 
necessary to bias the motor translocation in space^. 

Third, the dynamics is overdamped: at the nanometer scale with velocities a^s they 
occur here, viscosity dominates over inertia (i.e. the Reynolds number is much 
smaller than 1) [68, 26]. 

Fourth, the motor is in contact with a thermal bath, and from estimating time 
scales it turns out that it can only work as an isothermal engine local tem- 
perature gradients cannot be used to drive the motion and a thermodynamical 
description based on some type of Carnot cycle, i.e. operating between heat reser- 
voirs of different temperature, is not valid [45, 62]. 

6. Mathematical Modelling 

In the previous section, we outlined the general physical boundary conditions 
within which motor proteins function. Any mathematical modelling needs to con- 
form with these. Both Brownian conformational fluctuations as well as the stochas- 
tic character of the chemical transitions have to be taken into account. Indeed, it 
is hard (and probably misleading) to distinguish between fluctuations that act on 
spatial degrees of freedom from those that act on chemical ones. 

Coming back to the high-dimensional potential energy landscape introduced in the 
last section, we are therefore now trying to model the progress of the fluctuating 
system point across this landscape. To be able to treat this system computation- 
ally, one has to treat, in any case, ATP and the hydrolysis products statistically. 
It is impossible to explicitly keep track of the large number of those molecules 
in solution. One therefore needs to define chemical reaction coordinates (in most 
models just one) describing the progression of the chemical cycle in a stochastic, 
thermodynamic manner^ ^ [67, 52, 66, 34] (and for more references [45, 71]). For 
clarity at the computational level, many models further simplify by drastically 
reducing the number of mechanical degrees of freedom. There are different possi- 
bilities to do that. 

■T>om thermodynamics we know that the longtime behavior of a Brownian particle in a homo- 
geneous environment is diffusive and symmetric, i.e. the average velocity is zero. A macroscopic 
velocity can only appear when both time reversal symmetry t ^ —t and parity symmetry in 
space X —X are broken (’’Curie’s Principle”) [8, 67]. 

^^^Thermal fluctuations at such a length scale relax with characteristic times much shorter than 
the characteristic time of the ATP hydrolysis cycle [62]. 

Standard chemical kinetics models do not describe the spatial degrees of freedom of the species 
concurring in the chemical reaction. Here we are explicitly interested in the conformational 
changes of the molecules and theoretical models should couple spatial and chemical degrees of 
freedom, extending the concept of “reaction coordinates” to a mechanochemical process. 
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The number of conformational and translational degrees of freedom of the motor 
can be reduced by some kind of averaging or projection process, in most existing 
models all the way to one translational degree of freedom along the track. This 
could be the position of the center of mass of the motor, or the position of a probe 
attached to the tail, etc. 

An obvious problem with this simplification is that degrees of freedom which may 
be important for the mechanical cycle, but are orthogonal to the chosen projection, 
will be ignored. For example, the trailing head of a dimeric processive motor such 
as kinesin might move forward without displacing the load carried by the leading 
head. 

The multidimensional effective potential energy landscape is now “reduced” to a 
purely two-dimensional profile spanned by one “chemical” and one “spatial” co- 
ordinate. Its shape can be conveniently pictured as a landscape of valleys, hills, 
plateaus and saddles which are periodically repeated since the motor cyclically 
hydrolyzes ATP and moves on a periodic track, see Fig. 3. In this picture, diflFu- 




Figure 3. Highly simplified two-dimensional energy landscape 
for a motor protein with a chemical and a spatial axis (arrows). 

An external load would tilt this landscape in the spatial di- 
rection, a chemical non-equilibrium in the chemical direction. 
Mechanochemical coupling is reflected in the diagonal path (black 
line) a particle (black sphere) would advance along the spatial axis 
even with a purely “chemical tilt” . (For colored picture see color 
plate 4). 

sive parts of the motor path are represented by close-to- level valleys or plateaus 
which the system can cross on thermal energy alone in a reasonable time. Directed 
motion or unbalanced chemical reactions occur across steep parts of the terrain 
(local saddles with heights above the valley bigger than kuT). The fact that the 
motor is operating under chemical non-equilibrium conditions (A// > 0) and pos- 
sibly in the presence of an external load (/ext ^ 0) is represented by a global 
downward tilt in the chemical direction and possibly the spatial direction respec- 
tively. In the absence of mechanochemical coupling, the motor, now represented 

These are characterized by average slopes of the order of over characteristic distances of 
a substantial conformational change performed by the motor. 





164 



A. Parmeggiani and C.F. Schmidt 



by a stochastic particle moving in this potential landscape, could progress along 
the chemical or mechanical axis alone, if driven by A/i or /ext, respectively. The 
presence of the mechanochemical coupling, however, is reflected in the structure 
of the landscape, roughly speaking forcing the particle to move diagonally even 
if driven only by a chemical or only by a mechanical force. In other words, the 
effective potential has saddle points which are obliquely oriented with respect to 
the chemical and spatial coordinate axes. 

Recognizing that the reaction is cyclical, except for a steady decrease of the ATP 
concentration in the (large) reservoir, the “chemical tilt” can be represented as a 
reaction cycle in which the rates do not obey detailed balance, i.e. are biased in 
one direction around the chemical cycle. The “mechanical tilt” generated by an 
external load then modifies the characteristic rates of the conformational changes 
driven by the ATP hydrolysis. 

Going even further in simplification, one can discretize one or both of the remain- 
ing two degrees of freedom and introduce chemical and/or spatial “steps”. 

We will focus in the following on a model with a discrete number of chemical 
states combined with a single continuous spatial variable^'^ [67, 6], which could 
be, but does not need to be fixed to be the motor’s center of mass. The notion 
of a chemical ”5^a^e” [51, 23] (see also [46]) is usually tied to the experimentally 
identifiable intermediate states of the ATP hydrolysis cycle. The chemical reaction 
coordinate is then represented by a discrete chemical variable i = 2, N {i is the 
index of a chemical step). The states can be seen cts the projections of the free en- 
ergy minima onto the chemical coordinate. While the actual number of measurable 
intermediates may be higher, the simplest case that reproduces mechanochemical 
coupling is A = 2 [67, 6]. The motor movement is ’’reduced” to the overdamped 
and stochastic dynamics of a particle that moves in N different (one-dimensional) 
free energy landscapes with transitions between states that take place with char- 
acteristic rates (each transition corresponds to a specific substep of the chemical 
reaction or to a purely thermally activated event [46]). The use of a continuous 
potential energy landscape along the spatial coordinate allows an easy description 
of the effect of an external load on the motor, although the possible importance of 
the exact spatial direction of the external force cannot be described in detail by 
the one-dimensional projection. The polarity of the motor- filament interaction is 
described by the breaking of the spatial inversion symmetry, x ^ — x , by choos- 
ing, for example, at least one of the N potential landscapes with an asymmetric 
profile. In the model we further assume that the transitions between states occur 



^'Alternatively the spatial variable can be also be made discrete since the filament has a periodic 
structure and the conformational change of the motor has a size comparable to the size of the 
motor domain itself. In this case, the displacement of the motor is described in terms of kinetic 
rate equations where particular chemical intermediates are coupled to specific locations along 
the mechanical axis. The transition rates can then be dependent on the load applied [34, 39, 50]. 
Discrete or continuous models have not to be considered antithetic. It is a matter of definition of 
the coarse-graining or the continuous limit procedures to pass from one to the other description 
[50, 34]. 
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without a displacement of the motor: in other words, the chemical reactions are 
assumed to be “load independent”^^. There still is a coupling of the chemical reac- 
tion to the external load, but it occurs in a different way (see below). Next comes 
a design decision about the importance of Brownian conformational fluctuations 
in the mechanical cycle, i.e. a decision on what shape of potential profile to con- 
sider. The presence of “diffusive segments” in the cycle can in principle be tested 
experimentally. In one extreme, the purely diffusive scenario, one can assume that 
the free energy derived from the ATP hydrolysis is used only to lock-in particular 
thermal fluctuations once they occur, in the other extreme one can assume that the 
free energy is used to produce a directed force generating conformational change, 
and that Brownian fluctuations play a role only under specific conditions (for ex- 
ample, when the motor is under load and close to stall). The first case is that of 
the original ’’thermal ratchets” which rectify Brownian motions [13, 32, 90], the 
second case corresponds to the now widely accepted ’’power-stroke model” for the 
action of myosin [32, 33] (see also [45, 71]). Real motors probably work in between 
these two extreme situations, with some parts of the force- or motion-generating 
mechanisms relying on diffusive transport and others on conformational changes 
driven by the steep local potential slope. 

In the following we describe as an example a two-state model inspired by the pro- 
cessive mechanism of conventional dimeric kinesin [21, 62, 63]. 

The cooperative and alternate action of the two heads is described by the reactions: 



C| + ATP 


tti 


C 2 + ADP + P 


C 2 + ATP 


a-2 

/?i 


Cl + ADP + P 




/^2 





C\ and C ‘2 are two particular configurations of the motor during its interaction 
with the filament: for example, C\ is the configuration with the first head attached 
and the second head detached, while in C 2 the role of the two heads is exchanged, 
see Fig. 4. The ATP hydrolysis drives the transition between these two configura- 
tions and the motor performs a typical step of //2 per ATP molecule hydrolyzed 
(tight-coupling). We assume here an exact coordination between the two heads: 
the attachment of one head corresponds to the detachment of the other. 

Further assuming symmetry between the two identical heads, this dynamics is de- 
scribed by the stochastic motion of a particle in two identical potential landscapes 
shifted in space (ISS model) [62], see Fig. 4. Two coupled continuity equations 
describe the evolution of the probability densities, P\{x,t) and P‘ 2 {x,t), to find 



^^This can be rationalized assuming that the chemical reactions are fast compared to the motor 
conformational changes. In reality this is not necessarily the case. The conformational change 
can be the precondition for a chemical step, for example product release (see section 5.1). 
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the motor protein in the state 1 or 2 at time t and position x [6, 67]: 
dtPi+dxJi = -iL>i{x)Pi LU2 {x)P2 

dtP2pda:J2 = UJl{x) Pi ~ UJ2{x) P 2 ( 2 ) 

(note that the total probability is conserved and normalized to 1). 

Assuming periodic boundary conditions (the potential is periodic as is the track), 
the probability currents, Ji = ^[— — (9xFPi — /ext)]7^2, describe the stationary 
dynamics (i.e. when dtPi = 0) along the spatial direction x, characterized by a 
diffusion term and the driving force of the potential, dxWi^ in the presence of an 
external load, —/ext, applied parallel to the filament (in an experiment this could 
be for example the force applied by a laser trap). 

Here, /x is a “microscopic” mobility T is the temperature, ks the Boltzmann 
constant. W\{x) and W 2 {x) are the periodic and asymmetric free energy profiles. 
The transition rates uj\ and lj 2 are source terms that depend on the ATP hydrolysis 
reaction scheme (1). They are constrained to be non- zero only in the vicinity of 
the minimum of the potentials W] and W 2 to satisfy the notion of allostery in 
enzymology ^ uo\ and 002 can be directly computed from the previous reaction 
pathway as: 

a;, =«,+/?,= 

=a2 + /3‘2= + (3) 



where the 1-periodic function d 2 {^^) is non-zero and equal to lj only for l — d < x < 1. 
By symmetry l3\ (x) = a 2 (x — 1/2). These transition rates can be rewritten in terms 
of the ATP concentration by substituting and 

Spontaneous motion (non zero average velocity at zero external force) or genera- 
tion of force occur when, in presence of a spatial asymmetry, the chemical potential 
difference A/x = p ~ k'ADP ~ k'P, is non-zero (i.e. there is an excess of ATP). 
In this case the transition rates uji and uj 2 do not globally satisfy the “detailed 
balance condition” [67]: uj\{x)/uj 2 {x) / exp[(VPi - W 2 )/kBT]. The excess of ATP 
is necessary to increase the rates uji and U 2 which otherwise would be extremely 
small due to the potential barrier \W 2 — W\ \ at the transition regions, Eq. (3) (see 
also Fig. 4). Once it has jumped into the new chemical state without a change in 
the spatial coordinate, the particle experiences a driving force equal to the poten- 
tial slopes d,iWi and performs, on a slower time scale than the chemical transition, 
a step of //2 with average velocity: 



V — 




+ J2) 



( 4 ) 



microscopic mobility is a phenomenological parameter. It summarily describes a number 
of unknown features of the motor protein dynamics and conformational changes. 

^®The transition driven by the ATP hydrolysis is activated only when the protein/track com- 
plex assumes a specific conformation, and this is the “backdoor” through which load dependent 
chemical rates enter the scheme again. 
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Figure 4. Schemes of the hand-over-hand mechanism (left) and 
its translation in terms of a two-state model (right). The different 
phases of the coordinated activity of the motor domains are re- 
duced to the motion of a stochastic particle at position x and in 
chemical states 1 or 2 (see text). The phase A corresponds to the 
transition driven by the ATP hydrolysis (hydrolysis and energy 
release), while the phase B to the sliding of the particle over the 
potential profile (conformational change). For colored picture see 
color plate 5. 



and a non zero rate of ATP consumption r, i.e. the average number of ATP 
molecules hydrolyzed per unit of time: 

r=[ dx[{ai{x) - (3i{x))Pi{x) - {a 2 {x) - (32 {x))P2(x)] . (5) 

Ji) 

6.0.1. Model Parameters Estimated from Experiments with Kinesin The poten- 
tials W\ and W 2 are assumed to be piecewise linear. This is justified for most pur- 
poses, since a well-defined instantaneous velocity can not be measured for a single 
molecule in any case. The only meaningful concept would be an ensemble averaged 
velocity, which could preserve local shape details in the potential, at least on an en- 
ergy scale comparable to or bigger than ki^T. However, since we already “crudely” 
simplified the physical reality by projecting all spatial degrees of freedom onto one 
coordinate, it is reasonable to simplify a little further. The actual potential shape 
will be important close to stall. This special case will be discussed later. While the 
assumption of piecewise linear potentials may be simplistic, it allows us to obtain 
a simple and consistent estimate of the potential amplitudes VF ~ 20 /c^T by cal- 
culating the product of the typical step performed by the motor, 1/2 = S nm, and 
the stall force /^xt ~ 6 — 7 pN measured in experiments [85, 97]. Under the same 
assumptions, a microscopic mobility can be estimated as ~ 120 nm/pN • s from 
the maximal velocity produced by the motor r’max ~ 1 p^m/s and the potential 
slope W/l, by using the approximation Vuuxx — pdxW{x). Note that the effective 
friction ^ = l/p which we derive from the value of p is 10^^ — 10"^ times larger 
than the friction of a bead of 10 nm size in water. The viscous drag in water even 
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of a typical bead (0.5 / 2 m) used in experiments to track the motor movements, 
is negligible compared to this effective ’’viscosity” due to the internal properties 
of the motor and its interaction with the filament. This viscosity should not be 
taken literally as characterizing the surroundings of the part of the protein per- 
forming conformational changes. Although under zero load conditions obviously 
the entire free energy provided by ATP hydrolysis is dissipated, this does not have 
to occur at the average velocity assumed above. One can also imagine that the 
motor makes much more rapid conformational changes in a low viscosity medium, 
but has a rate limiting energy barrier to pass, which sets the maximal turnover 
rate. In any case, the maximal speed at which the motor moves is likely to be 
adapted to the very dense environment, the cytoplasm, through which kinesin has 
to move loads. Handwavingly, by “impedance matching” the motor can work more 
efficiently when waiting for the viscous stresses generated by its motion to subside 
before attempting the next step. 

We estimate the parameter lj 0.1 s~^ • M~^ from the [ATP] dependence of 
the measured velocity [97] assuming that, at low ATP concentration, the average 
velocity v is limited by the transition rates uji. 



6.1. ‘‘Macroscopic Approximation” and Efficiency vs. Load 



6.1.1. Definition of Efficiency Using the quantities introduced in Eqs. (4) and 
(5), we can define the mechanical efficiency of the motor protein self-consistently 
and without violations of the First and Second Law of Thermodynamics. Energy 
conservation is maintained when the average energy provided by the chemical 
reaction per unit tinie*^, rA/j, is partially converted into mechanical work per unit 
time done by the motor against an external force, and partially dissipated 

in the thermal bath, 11: i.e. rA/i = —fvxt'^ T Ft. The average energy dissipated II 
must be strictly positive, FI > 0. The mechanical efficiency is then defined as the 
ratio: 






fvxtV 

rA/2 



(6) 



From this definition, r] is zero when the motor is unloaded {foxt — 0) or when the 
load applied to the motor is large enough to cause it to stall {v — 0) while the 
motor is still hydrolyzing ATP (r > 0). The situation of zero efficiency at stall 
can be rigorously proved to exist in the linear regime^^ (i.e. fcxtl^Ajj, k^T). 
The definition is completely independent of the model considered. It can be also 
proved to exist in the non-linear one when using particular configurations for the 
potentials [45, 62, 64] (see also next section). 

Note that the point (/^xt = 0, A/i = 0), which describes the thermodynamical 
equilibrium, is singular in our definition of efficiency. 



^^The motor works cyclically. 

^^The existence of a linear regime is a non-trivial consequence of the fact that the motor is 
isothermal: the linear regime in which a motor could theoretically work, although in reality it 
does not seem to be the case for cytoskeletal motors, is defined els the regime of energies small 
compared to ksT or forces small compared to the “thermal kicks” kuT jl. 
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From the definition (6), the efficiency is bounded by 0% and 100%. The upper 
bound can be reached in particular regimes [62, 64]. 

It is important to note that, contrary to thermal Carnot engines (in which the max- 
imal efficiency is obtained approaching equilibrium, i.e. in the quasi-static regime)^ 
in isothermal nanoscopic engines driven by a chemical reaction a maximal mechan- 
ical efficiency can be achieved far from equilibrium (i.e. in a totally irreversible 
regime^'^) [62], precisely in the regime where motor proteins operate. For example, 
at a given chemical potential difference, the efficiency r] always reaches a maximum 
value placed between the futile dissipation of ATP energy at stall and the viscous 
loss associated with the conformational changes at maximum velocity and zero 
load. The situation is similar in an electrical motor. At maximum velocity and 
zero load the friction in the bearings dissipates all the electrical energy delivered 
to the motor. At stall with maximum current the finite resistance of the motor 
windings still causes a voltage drop between the terminals and therefore dissipa- 
tion through heating of the motor. 

The role of thermal fluctuations acting on the motor proteins can be highlighted 
by studying the efficiency as a function of external load. For this purpose it is 
useful to first discuss a macroscopic approximation and to see how it differs from 
the exact result obtained by solving Eqs. (2). 

The model described in Fig. 4 has a simple interpretation in terms of two char- 
acteristic times: the transition time, Tt (see the phase A in Fig. 4), and the drift 
time, Td (see the phase B in Fig. 4). These two times can be estimated assuming 
that the motor behaves as a particle in macroscopic mechanics (i.e. no fluctua- 
tions) moving in a time Td over a distance 1/2 without inertia under the total force 
and jumping to the other state after a time T/ . The drift time is defined 
by Td — l^/2ii{W + lfvxi)-> while the transition time is defined by n = l/u;[ATP] 
(i.e. the inverse of the transition rates uj\ or a;^). Given these two times, we esti- 
mate the average velocities v ^ v — l/2{rd + Tt) and r ^ r = l/2{Td + Tt) and 
the efficiency rj ^ f} = - /extT/f^A/x. At high ATP concentration, n < r^, and 
V ~ l/2Td, while at low ATP concentration, T/ > Td, and v ~ l/2Tt. 

What is now the effect of fluctuations on the efficiency rj in comparison to the 
’’macroscopic mechanics” approximation fjl In Fig. 5(a), the continuous curves 
represent the efficiency rj as a function of the external load at low and high ATP 
concentrations‘^^\ The results were obtained from the analytical solution of Eqs. 
(2) for the ISS model [65]. The dashed curves represent the macroscopic approxi- 
mation of the efficiency r). The macroscopic estimate neglecting fluctuations works 



^•T^or finite values of the velocities v and r and the generalized forces fcxt ^ ksT 1 1 and A/x ^ 
koT. 

^•^T^om Fig. 5(a) we read off an estimate of 40% for the efficiency of kinesin at high [ATP]. 
Efficiencies of myosin If have been estimated by calorimetry in muscle [9, 24] to be at least 50%. 
A direct mecisurement on single motor proteins is not yet available. Direct measurement of the 
ATP consumption r is very difficult, especially since the crucial events decreasing the efficiency 
at increasing load would be futile ATP hydrolysis events or backwards steps. An exception is the 
efficiency of the rotary motor Fi-ATPase [99], which has been estimated to be close to 100%! 
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Figure 5. Load dependence of the mechanical efficiency ry (a) 
for different ATP concentrations (b) for different potential am- 
plitudes W at [ATP] = 1 mM. In figure (b) the amplitude W 
has been normalized by the slope of the potential W /I which de- 
termines the value of the stall force. Continuous lines are results 
obtained by the analytical solution of the ISS model, while the 
dashed lines are approximations obtained by using the character- 
istic times T(i and T/ (see text). The model parameters not cited 
in the text are the asymmetry coefficient a/ 1 = 0.2 and the width 
of the localized transition d/l = 0.05. 



fine up to intermediate values of the load (~ 3.5 pN). In this situation, the driving 
of the potential dominates over the external forc:e, ^J.W^ > fcx\^ and the particle 
has always a probability close to one to reach the transition region and to change 
state with a characteristic rate LUj. When a load above 4 pN is applied, the approx- 
imation begins to fail for both values of the ATP concentration, not predicting the 
maxima in the efficiency and the subsequent decrease to zero at stall. 

This is due mainly to the fact that, in the presence of a relatively high load, the 
particle gets within kuT of the top of the barrier. In other words, at high load the 
probability, that a thermal fluctuation (energy borrowed from the heat bath) helps 
the particle to overcome the top of the potential Wi after a transition, increases. 
Neglecting a possible detachment of the motor from the filament [63], this event 
corresponds to a “backward step” of the motor, meaning that all the chemical 
energy of the preceding ATP hydrolysis is dissipated in friction down the positive 
and steep slope of the potentials Wi [62]. The motor stops to advance while it 
continues to consume ATP (r > 0) and the efficiency p drops to zero. 

This interpretation is confirmed by Fig. 5(b) showing that the maximum value of 
the efficiency strongly depends on the potential amplitude IT, ranging from close 
to zero to close to one. In other words, a high W amplitude compared to kf^T 
prevents the possibility of backward steps, and the range of applicability of the 
macroscopic approximation increases. Predictably, even for potential amplitudes 
largely outside the range motor proteins can reach {W = 400 ksTl)^ the efficiency 
p eventually drops to zero at the stall force value. In a microscopic system subject 
to thermal fluctuations it is in principle impossible to neglect the fluctuations at 
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a load close to the stall. At some point the load will pull the system into a state 
where the fluctuations dominate the dynamics and the motor can not truly stand 
still without consuming chemical energy. This is the reason why muscles get tired 
supporting a static load. The best an evolutionary design can do is to slow down 
transition rates to decrease ATP consumption while the efficiency is zero. This 
can be seen, for example in smooth muscle, where the function is rather to hold 
tension than to rapidly generate it. Smooth muscle myosin hydrolyzes ATP about 
10 times slower than skeletal muscle myosin. 



6.1.2. The Usefulness of Mathematical Models. We have seen that motor proteins 
operate as microscopic mechanochemical transducers. They are microscopic sys- 
tems in which mechanical and chemical degrees of freedom are intricately coupled. 
Such systems can not possibly be described using a full mechanical description of 
all the many degrees of freedom in a fluctuating environment, nor is it possible to 
treat the small reaction partners, ATP, ADP and Pj in anything else than a ther- 
modynamic approximation. The price for a severely simplified description such as 
the one we described above is that the model entirely neglects the actual three- 
dimensional dynamics of the protein by describing only one degree of freedom in 
space. Evidently, there are gaps to fill. Nevertheless, a simplified approach makes 
it possible to understand “universal” properties of motor proteins, despite their 
biological diversity. As an example, we discussed the efficiency of motor proteins, 
and explained the generic feature of zero efficiency at maximum speed (i.e. at zero 
load) and at stall with a maximum in between. 

Furthermore, the coarse-grained approach used in typical models is expected to 
be particularly successful in understanding phenomena on a larger than single- 
molecule scale, such as collective phenomena involving many interacting motors. 
For instance, a two-state model was extended in order to describe a collective sys- 
tem of coupled motors [47, 48]. This approach provided a theoretical description 
of spontaneous oscillations that have been observed in purified muscles fibers [98] 
and recently it ha>s been shown that the same model can be applied to understand 
the oscillations of flagella and hair-cells, see for references [49]. Similar behaviors 
have been reproduced in models that generalised the power-stroke mechanism of a 
single motor [32] to a collection of them [11, 94]. Motors that collectively interact 
with cytoskeletal filaments can form patterns resembling those found in cells, such 
as the mitotic spindle [58, 12, 59]. Under the appropriate boundary conditions one 
can also model experimentally observed phenomena such as tension generation and 
contraction of cytoskeletal filament bundles (stress fibers, contractile ring) [41, 42]. 
Coarse grained modelling, applied to the dynamics of adsorption and desorption of 
dimeric motors to filaments, wa.s also successful in intepreting electron microscopy 
decoration experiments [95, 96]. All these successes are essentially baised on the 
fact that collective phenomena do depend less on the detailed properties of a single 
motor composing the system than on the global rules determining their interac- 
tions. 




172 



A. Parmeggiani and C.F. Schmidt 



7. Outlook 

All systems studied in biology are the result of an evolution of millions or billions 
of years. For the complete understanding of something as highly complex as motor 
proteins it will be neither sufficient to only develop a universal stochastic theory 
nor to only, say, solve all their atomic structures. Different approaches need to be 
combined. In doing that, it is important to recognize the limitations inherent in 
each approach. Only with that knowledge will the different approaches eventually 
converge to a detailed understanding of motor proteins. 
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1. Motivation and Hypothesis 

The increasing interest of materials scientists in biomineralisation phenomena is 
caused by the option of new biomimetic processing routes for materials with un- 
usual structures and properties. Among them new biomaterials for application in 
medicine are of great interest. In this paper we will discuss a new biomimetic 
approach for manufacturing of bone implants based on hydroxyapatite (HAP)- 
collagen composites. One characteristic feature of all biomineralisation processes 
is the control of nucleation and growth of the crystalline phase by interaction with 
a biomolecular template. When we consider the development of substitutes of hard 
tissue then the creation of a nanocrystalline structure of the mineral phase (HAP) 
in the new materials has to be the final goal because the nanocrystallinity is one 
precondition for a high potential for remodeling when bone resorbing cells (os- 
teoclasts) and bone forming cells (osteoblasts) are interacting with the material. 
From many biomineralisation studies it is known that the final inorganic structure 
formed on a biomolecular template or in a biomolecular compartment is the result 
of a competition between template induced nucleation versus inhibited growth by 
specific adsorption of biomolecules on certain crystal surfaces [1]. Thus the crys- 
tal morphology as well as the crystal size can be designed by using appropriate 
biomolecules. The development of artificial bone substitute has to be focused on 
two goals - to create a material with bone like mechanical properties, and to ful- 
fil the precondition for the intended remodeling by cellular activity. A possible 
approach for that seems to be starting from a basic material composed of the 
main components of bone - HAP and collagen - which should be modified with 
some noncollageneous proteins for governing the evolution of the inorganic phase 
as well as for activation the cell interaction. We previously described the miner- 
alisation of collagen I [2] by following a biomimetic route. It is known that the 
noncollageneous proteins involved in hard tissue formation possess an increased 
percentage of aspartic and glutamic acid. For example in bone sialoprotein there 
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are 19.3% glutamic acid, and 5.6% aspartic acid, or in osteopontin there are 9.1% 
glutamic acid and 16.1% aspartic acid. From this observation it has been concluded 
that the assembling and mineralisation of collagen could be controlled by adding 
polyaspartic acid or polyglutamic acid to a precusor consisting of collagen, calcium 
and phosphate ions. At pH of about 7.0 collagen fibrils with HAP nanocrystals 
have been prepared. These fibrils were used to manufacture HAP-collagen tapes 
as shown in Fig. 1. 

Osteoblasts are proliferating very well on these tapes. For instance in Fig. 
2 it can be seen how osteoblasts are growing on a laser structured tape. After 
three days already the first cells are observed on the back side of the tape growing 
through the 40 jim pores. 




Figure 1. HAP-collagen tape. 

Furthermore, HAP-collagen I composites have been prepared starting with 
the mineralised collagen fibrils and Biocement D (a hydraulic bone cement [3], 
Merck Biomaterial GmbH, Germany). 



2. Osteocalcin Control of the Mineral Phase 

In order to include the cellular activity for remodeling of the HAP-collagen scaffold 
a further biological functionalisation of the material would be favourable. There- 
fore, appropriate proteins should be immobilized on the HAP-collagen composite 
which are carriers of signals for a specifc activity of the osteblasts or the osteoclasts. 
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Figure 2. In-growth behaviour of mouse osteoblasts on an ex- 
cimer laser structured HAP-collagen tape; (left) laser structured 
tape; (right) back side of the structured tape after three days. 



One interesting candidate of such proteins is osteocalcin. In bone, osteocalcin (bone 
7 -carboxyglutamic acid protein, Gla protein) is the predominant non-collageneous 
protein with contributions up to 2% of the total protein in bone tissue. It revealed 
itself as a highly specific osteoblastic marker produced during bone formation [4]. 
The osteocalcin molecule contains 46-50 amino acid residues and has a molecu- 
lar weight of 5200-5900. Conformational studies showed that osteocalcin consists 
of a-helical and /3-sheet structures. Two antiparallel a-helices, the “Gla-helix” 
(residues 16-25) and the ‘‘Asp-Glu helix” (residues 30-41) are framed by ^-sheet 
structures. Because of its content of three Gla residues in each molecule, osteo- 
calcin has a strong Ca^^ binding effect. The carboxyl groups of Gla residues are 
the main binding sites for Ca‘^+. Therefore, it can be presumed that osteocalcin 
interacts with Ga^^+ in the HAP crystal lattice in bone also. Hauschka proposed 
a model which reflects the crystallographic correspondence of the osteocalcin and 
particular lattice planes of the HAP crystal [4]. T he very special property of the 
“Gla-helix” is the regular spacing of the three charged Gla residues at intervals 
of three or four residues (e.g. residues 17, 21, 24) with hydrophobic residues be- 
tween them. The Gla-residues in the a-helix have a periodicity of 5.4 A which is 
remarkably similar to the interatomic lattice spacing of Ca‘^+ in the (0001) plane 
of HAP (5.45 A). Hauschka suggested a specific adsorption of osteocalcin to these 
surfaces. An experimental evidence for this assumption has not been found until 
now. The important question is whether the crystal growth perpendicular to the 
(0001) HAP plane is inhibited by specific adsorption of osteocalcin molecules. A 
first evidence for such a specific adsorption can be concluded from studies with 
collagen-Biocement D composites. Biocement D supplemented with 2.5% miner- 
alised collagen shows a plate-like crystalline microstructure with a characteristic 
grain size in the range from 50-100 nm (Fig. 3). Addition of 0.0008% osteocalcin 
during mineralisation results in a significantly finer microstructure as shown by 
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Scanning Force Microscopy (SFM) of the grown crystal structure. Obviously os- 
teocalcin inhibits the growth of the HAP crystals by specific adsorption on the 
crystal surfaces. A more direct proof for the influence of osteocalcin on the min- 
eralisation has been given by the observation of the first stages of HAP growth in 
presence of osteocalcin. 




|jm pm 



Picture 3. 3D SFM-images of Biocement D-2.5% mineralised col- 
lagen I composites without (left) and with (right) osteocalcin. 



We developed a model system using an osteocalcin-controlled dissolution- 
reprecipitation process of brushite (DCPD, CaHPO.i * 2 H 2 O) to HAP [5]. Brushite 
crystals form a smooth (010) surface which allows the imaging of HAP formation 
on brushite. Dissolution-reprecipitation of calcium phosphates can be controlled 
by their pH-dependent solubilities [6]. At room temperature and pH values be- 
low 4.2, DCPD is the most stable calcium phosphate, whereas above 4.2 HAP 
is preferentially formed. When DCPD is exposed to a solution with a pH >4.2 
the more stable calcium phosphate HAP will reprecipitate. Although solubilities 
of DCPD and HAP are very low, the solubility of DCPD is higher then that of 
HAP under such conditions. Therefore, the HAP formation occurs very slowly. 
However, in presence of osteocalcin we found an accelerated nucleation. At the 
same time osteocalcin inhibits HAP crystal growth by adsorbing specifically to the 
(0001) face of apatite crystals. Direct imaging of the protein-crystal interaction was 
possible by SFM (Fig. 4). The experiments verify the idea of a strong crystallo- 
graphic coupling of osteocalcin adsorption and the HAP structure as developed by 
Hauschka [4]. From this study a four stage model has been derived explaining the 
observed dissolution-reprecipitation process, consisting of the stages (i) dissolution 
of brushite at pH > 4.2, (ii) osteocalcin stimulated nucleation of HAP-nuclei, (iii) 
osteocalcin terminated HAP-layer growth, (iv) growth stress driven patterning of 
the layered nanocrystals [5]. 
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Figure 4. Coverage of the (0001) surface of apatite with osteo- 
calcin, SFM images. 

3. Influence of Osteocalcin on the Osteoblasts Interaction 

It has been shown that osteocalcin is fulfilling a first task important for the in- 
tended remodeling - the formation of a nanocrystalline HAP phase. From that the 
question arises how the cell adhesion at these surfaces is changed by the presence 
of osteocalcin. In a first step we investigated the interaction of human osteblast- 
like SAOS-2 cells with collagen-Biocement D composites. The cells were allowed 
to adhere for 2 h on the collagen-Biocement D-composites without and with osteo- 
calcin. Round-shaped cells as well as spreading cells with filopodia can be detected 
on composites without and with osteocalcin (Fig. 5). 




Figure 5. Adhesion of an osteoblast in absence (left) and in 
presence (right) of osteocalcin (2 h after deposition). 

In Scanning Electron Microscopy (SEM) images it can bee seen that SAOS-2 
cells appear to be more fiattened spread on osteocalcin-containing biomaterial (Fig. 
5, right), whereas cells on Biocement D-collagen composite without osteocalcin just 
start to fiatten (Fig. 5, left). From this observation it cannot be concluded what 
mechanism could be responsible for the accelerated spreading, the nanocrystalline 
HAP structure, or the presence of the osteocalcin with specific signals for the 





182 W. Pompe, K. Flade, M. Gelinsky, B. Knepper-Nicolai and A. Reinstorf 



cells. Maybe another observation can give some hints. After 3 days SAOS-2 cells 
proliferated on Biocement D-collagen I composites without and with osteocalcin 
functionalisation in the same manner. No quantitative differences in adhesion could 
be determined by SEM imaging. This behaviour could be explained when we keep 
in mind that the osteoblasts are able to produce osteocalcin during these three 
days on the non-functionalized surface too. Thus it seems to indicate that the 
preferred spreading of the cells after 2 h is a result of a specific cell interaction 
with the osteocalcin. 



4. Conclusions and Open Questions 

The study has shown that the immobilisation of non- collageneous proteins on 
the surfaces of collagen-HAP composites opens the door for the activation of bone 
forming cells. The hypothesis of Hauschka concerning the specific interaction of 
osteocalcin with the growing HAP crystals has been experimentally proved. Thus 
a favourable nanocrystalline structure of HAP can be created. Additionally, the 
functionalisation of the material with osteocalcin improves the spreading of os- 
teoblasts in the early stage of interaction. From this investigation we hope to 
develop a future strategy for the remodeling of artificial bone implants based on 
collagen - HAP composites which follows a route shown in Fig. 6. 




ControLof crystal I>issolution- 

mideation and grower repredpitatitm kinetics 



Figure 6. Scheme of in vitro remodeling of artificial bone im- 
plants. In this approach specific non-collageneous proteins as os- 
teocalcin or osteopontin should be used to fulfil two tasks - tai- 
loring the microstructure of the mineral phase (HAP) as well as 
stimulating the cell adhesion. 

There are many open questions. We know that not only a chemical stimula- 
tion of the cells, but also a mechanical activation of the celluar activity is important 
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for the bone formation. It is of great interest which role the non-collageneous pro- 
teins play for the specific load transfer between the extracellular matrix and the 
cells. In the result of such a mechanical interaction the expression of other proteins 
with relevance for the scaffold remodeling would be possible. Therefore, in vitro 
studies of the scaffold remodeling in a cell reactor with the additional option of 
mechanical loading are of great interest. 
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1. Introduction 

Cell shape is influenced by physical surface topography of the substrate to which 
cells are attached [23, 8]. Most prominent is the polarization and elongation of cells 
along the direction of microfabricated grooves. Microstructured substrates are also 
known to affect intracellular processes such as apoptosis [4], gene expression [9] 
or protein production [6]. However, little is known about the mechanism of how 
surface topography exerts its effects, although several studies have focused on the 
role of cytoskeletal elements such £ls actin microfllaments [29], focal contacts [28], 
and microtubules [24] as all these structures are observed to align with topographic 
features such as grooves. Most of these experiments were performed with cultured 
flbroblcists where filament structures showed to align along microstructured lines. 
We examined cell shape alternations of human melanocyte cells cultured on micro- 
grooved polymeric substrates. These cells were derived from healthy persons and 
patients suffering on Neurofibromatosis 1 (NFl). NFl is caused by a germline mu- 
tation in the NFl -gene, a classical tumor suppressor gene. The majority of the NFl 
mutations are null mutations resulting in reduction of the related protein product, 
neurofibromin, to about 50% [16, 11]. Neurofibromin shows a Ras-GAP activity 
[14, 21] and is involved in regulation of growth in vitro [1, 20]. In addition, it is 
found to be associated with the cytoskeleton (Xu and Gutmann 1997) and NFl 
deficient cells exhibit morphological changes, as demonstrated for Nfl-/- Schwann 
cells [20] or cells from Drosophila homozygous for null mutations of an NFl ho- 
molog [27]. Neurofibromin reduction can also result in morphological changes in 
cultured human NFl keratinocytes in which it is co-localized with intermediate 
filaments [22]. We demonstrated that the cellular consequence of haploinsufficiency 




186 



R. Kemkemer, H. Gruler, D. Kaufmann and J.P. Spatz 



for the tumor suppressor gene NFl in melanocyte cells is an altered dendrite for- 
mation in vitro ([26]). This altered morphology of NFl cells could be related to 
an alteration of the signal-to-noise ratio for the regulation in two of three inves- 
tigated cell biological shape parameters [26]. In other words, in melanocytes the 
reduction of neurofibromin to a level of 50% is related to an increase of inaccu- 
racy in regulation of dendrite formation. In this paper we show that these cellular 
consequences of NFl haploinsufficency can be cancelled if cells are in contact with 
substrates that apply mechano-physical signals in terms of physical topography. 
Starting with investigation of morphological parameters of control and NFl cells 
grown on control and on microstructured substrates we quantify the experimental 
data of the cell morphology by a mathematical model. In this model the cell is 
considered as a self-organized automaton that adjusts to a characteristic shape 
parameter of the cells, the so-called set-point. The deterministic part of the self- 
organized automaton is altered by introducing noise to the adjustment process 
(inordinate cellular processes). This concept was already successfully applied to 
describe the orientation of fibroblasts [17] or keratinocytes [13] and self-organized 
alignment of interacting melanocytes in vitro [18, 19]. 



2. Methods 

2.1. Cell Types and Cultures 

The preparation of melanocytes from biopsies of skin of healthy donors and from 
skin of NFl patients was performed as described [12]. The germ line mutations in 
the patients investigated inactivate one NFl allele [11]. In NFl melanocytes one 
NFl wild type allele is present and the cells show a reduction of neurofibromin to 
approximately 50% [12, 10]. The melanocytes of two controls and two NFl patients 
were cultured in parallel. The patients were matched for age and gender, the cells 
for passage and density. 

2.2. Morphological Parameters 

Four days after starting the cell culture (seeding density 50/mm^) on the sub- 
strates 20 randomly taken pictures were digitized (450-600 cells per culture) by a 
Zeiss Axiovert 135 microscope equipped with a motorized stage and an attached 
Hamamatsu CCD camera (C5405). Based on the freely available NIH-IMACE 
(http://rsb.info.nih.gov/nih-image/) an image-processing algorithm was developed 
to determine the position of the cell body, the number of dendrites, the orienta- 
tion of the dendrites and their length. These data were stored in a database after 
normalization of the observed distributions: (1) The number of dendrites per cell 
was collected in a normalized histogram, /*(n) = N'(n)/A^o, with Nq total number 
of dendrites. (2) The angle, 0, between the dendrites of bipolar cells was mea- 
sured and normalized by the number of angles. This gives the angle distribution 
function /|(0). (3) The length. A, of a dendrite is collected in normalized length 
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distribution functions for two dendrites per cell: /|(A) Our theoretically derived 
distribution functions were fitted to the experimentally observed histograms. 

2.3. Preparation of Substrates with Grooved Topography 

Master substrates, silicon (lOO)-wafers (2 inch, Crystech, Berlin), were struc- 
tured by a combination of photolithographic techniques and Reactive Ion Etch- 
ing (RIE) resulting in 8 m wide grooves separated by 8 m having a depth of 
500 nm. Such master substrates were used as a mould for the pattern transfer 
to Poly(dimethylsiloxane) (PDMS) substrates [30]. PDMS (Sylgard 184 kit, Dow 
Corning) substrates were produced by casting a solution of the liquid PDMS on 
the master and curing it at 60C for at least 48 h. PDMS substrates were treated 
with oxygen plasma (50 Watt) for 5 s to increase cell adhesion. Control substrates 
were prepared by the same procedure using non-structured Si(100)-wafer. 

2.4. Mathematical Description of Self- Organization of NFl Melanocytes 

The mathematical description will model the cellular morphology of melanocytes 
as a self-organized cellular automaton as previously shown [26]. The melanocyte 
cell system is ideal for examining a model description as it offers well-defined cell 
shape parameters such as the number of dendrites, the angle between dendrites 
and the length of dendrites. Regarding the cell as an automaton, we have to define 
a so-called set-point of each of these parameters which is optimum for the cell 
automaton. For example, considering the number of dendrites we have to assume 
a set-point of the automaton, which is identical to the ideal number of dendrites 
of the melanocytes. The changes in the number of dendrites produced by the 
automaton can be described by the following rate increase: 

^ = -k*(n-no) (1) 

Since the automaton described by this equation reacts only when the actual 
number of dendrites (n) deviates from the set point (no), the speed (returning 
force) by which the cell returns to the set-point is determined by kn- However, 
according to this reaction all cells will end with the number of dendrites identical 
to the set-point, which obviously is not true for real cells. There is always some 
variation in the number of dendrites in melanocyte cell cultures. An additional 
term, Pn(^), has therefore to be introduced in the rate equation in order to account 
for the stochastic input introducing the variation into the number of dendrites 
observed. Recently, in a theoretical model stochastic processes were suggested to 
have effects in haploinsufficiency [5]. 

^ = -fc * (n - no) + r„(i) (2) 

at 

Since the angle between the dendrites and the length of the dendrites vary 
also, their behavior can be modeled in a similar way and described by similar 
equations: 



dt 



ks * sin{rii){Q - 0o)) + ^(-)(^) 



(3) 
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and 

^ = -kx*{X-Xo)+rx{t) (4) 

at 

Due to the stochastic disturbance that had to be introduced into the equa- 
tions, they cannot be used to predict the number of dendrites, their length or the 
angle between them. It is possible to predict for example the probability by which 
the angle between two dendrites falls between © and © -h A©. This probability 
is given by the stochastic angular distribution function /(©). Such a distribution 
function can in general be obtained by transforming the stochastic differential 
equations (Eq. 2, 3, or 4) into a partial differential equation if one assumes that 
the stochastic process in the signal chain can be approximated by white noise with 
a strength q (details see Risken 1984). The corresponding steady state distributions 
are: 

(a)/(n) = (6)/(0) = foe'^‘'^^;and{c)f(l) = (5) 

where /o is a constant, which is determined by the normalization 

I f{n)dn =1,J /(0)de = 1,J f(l)dl = 1. 

The predicted generating function V (n) for the number of dendrites is 



v{n) 



- — (n - U())^ 

Qn 



{n - np)’^ 



(6) 



The predicted angle distribution has to be compared with the measured one 
where no and are the fitting parameter. The whole signal transduction chain 
can be characterized by the signal- to- noise ratio, (5/A^)n- 
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The generating functions for the angle between the dendrites and the lengths 
and the corresponding signal-to-noise ratios are 
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3. Results 

3.1. Shape Parameters for Melanocytes on Unstructured Substrates 

Grown under the used culture conditions and on non-structured culture dishes 
melanocytes derived from skin samples of healthy persons show a typical bipo- 
lar shape with a cell body and two dendrites as shown in Fig. la in an optical 
micrograph. Only a few cells have three or more dendrites. 




Figure 1. Optical micrographs showing that the cell shape on 
non-structured substrates of (a) normal Melanocytes are usually 
bipolar and for (b) NFl melanocytes are more irregular having 
often more than two dendrites. Cultured on linearly grooved sub- 
strates, control (c) and NFl melanocytes (d) align their dendrites 
along the direction of the grooves and the shape becomes more 
regular. In the case of NFl melanocytes, cells having more than 
two dendrites are rarely being observed. The scanning electron 
microscope image (e) shows the alignment of a NFl melanocyte 
along the grooves and the fillopodia activity at the end of one of 
its dendrites (insert). 

Most NFl-melananocytes also show a typical bipolar shape. The maximum 
of the dendrite number distribution function of normal or NFl cells remains un- 
changed, which we call the set-point (no = 2) (Fig. 2). However, up to 42% of 
the NFl cells have three or more dendrites (Fig. lb and Fig 2). We define this 
increased uncertainty in the regulation of the number of dendrites per NFl cell as 
noise in the cellular automata. The width of the distribution corresponds to the 
stochastic part of the equation for the automaton describing the signal-to-noise 
ratio (S/N) (Eq. 7). The average S/N is 4.37 ± 0.27 (SEM) for the two control 
cell populations compared to 2.92 ± 0.42 for NFl cell cultures. 

Figure 3 shows a graph of the distribution of the angle between the two 
dendrites of control and NFl Melanocytes where only Melanocytes with two den- 
drites had been counted. The distribution of the angle between the two dendrites 
is broader for NFl cells (Fig. 3) and has the maximum at 180 representing the 
set-point. Again, noise in regulating the dendrite orientation is increased for NFl 
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Figure 2. The mean distribution funetions for the number of 
dendrites. The data points represent the observed numbers of den- 
drites (□ control on non-structured substrate; O control on lin- 
early structured substrate; V NFl melanocyte on non-structured 
substrate; o NFl melanocyte structured substrate). The lines rep- 
resent the fitted distribution function (eq. 5 a). All observed data 
could be simulated well by the equations presented using the ob- 
served set-point and variation. The set-point, i.e. the maximum of 
the distribution, is identical in all samples. However, the width of 
the distributions differs for cells on non-structured and structured 
substrates. 

cells. The signal-to-noise ratio is 2.61 ib 0.19 for control and 1.98 ±0.17 for NFl 
cells. 

The distributions for the length of the dendrites are given in Fig. 4. The 
signal-to-noise ratio for controls is 3.1 ± 0.6 (controls) and 2.2 ± 0.44 (for NFl 
cells). Differences with respect to the set-point are not significant as shown previ- 
ously (Schrank-Kaufmann et al. 2000). 



3.2. Shape Parameters for Melanocytes on Micro- Structured Substrates 

Figure Ic and Id shows optical micrographs of normal and NFl melanocyte cells 
on grooved substrates. The morphology and dendrites are highly oriented along the 
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Figure 3. The distribution function for the angle 0 between 
the two dendrites is given for control and NFl melanocytes with 
two dendrites (□ control on non-structured substrate; O control 
on structured substrate; V NFl melanocyte on non-structured 
substrate; o NFl structured substrate). The data for the angle 
distribution were fitted by the predicted distribution function (Eq. 
5 b) in a logarithmic plot. Controls and NFl cells show a set-point 
of 180 degree regardless on which substrate they grow. The width 
of the distribution for cells on non-structured substrates chances 
to a small width if cells are cultured on the grooved substrates, 
corresponding to an increased signal-to-noise ratio. 



direction of the grooves. As shown in Fig. 2, the distribution width of the number 
of dendrites of normal and NFl melanocytes decreases if cells are cultured on linear 
micro-structured substrates. This is especially the case for NFl melanocytes. The 
signal-to-noise ratio raises from 2.92 ± 0.42 to 5.05 ± 0.12 for NFl cells on the 
grooved substrate. For control S/N is 4.96 ± 0.18 compared to 4.37 ± 0.27 on the 
unstructured control substrate. The set-point remains unchanged for control and 
NFl cells (no = 2). 
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Figure 4. Distributions for the length of dendrites of one con- 
trol (□) and one NFl melanocyte (V) culture grown on a non- 
structured substrate, respectively on the structured substrate (O 
control, o NFl). Data points were fitted by Eq. (5c). The set- 
point (maximum) as well as the noise (width) of the distribution 
increase if the cells are cultured on linearly grooved substrate. 
The cells become more irregular with respect to the dendrite 
length. 



Figure le shows a scanning electron micrograph of a normal melanocyte cell 
on a micro-grooved substrate. Obviously, the dendrites are not running in the 
grooves but follow the grooves on top of the micro-pattern. The set-point for 
the dendrite orientation is not altered (©o = 180). However, the mean deviation 
between the set-point and the actual angle when determined as S/N is increased 
for the cells on structured substrates (Fig. 3). The values from the fit of the model 
(eq. 8) are S/N=3.78 ± 0.22 for controls (2.61 ib 0.19 on non-structured substrate) 
and 3.39 ib 0.19 for NFl melanocytes (1.98 ± 0.17 on non-structured substrate). 

In contrast, the length distribution observed for control and NFl melanocytes 
exhibits an increased set-point (maximum of the distribution) as well as a de- 
creased signal-to-noise ratio (Fig. 4). We find an increase of the set-point from 
23.4 m to 27.1 m for control and from 32.4 m to 38.3 m for NFl melanocytes if the 
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cells are cultured on the grooved substrate. The signal-to-noise ratio is reduced 
from 3.1 ± 0.6 to 2.7 ib 0.32 (control) respectively from 2.2 ± 0.44 to 1.7 ib 0.21 
(NFl). 




Figure 5. Distributions for the anisotropy A = (li - h)/ {I \ + 

I 2 ) of the lengths of dendrites for NFl cells with two dendrites. 

Cells on non-structured substrates (□) have almost equally long 
dendrites (A=0), whereas the anisotropy increases if the cells are 
cultured on a linearly grooved substrates (O). Each set of data 
was fitted by two Gauss distributions (lines). 

Remarkably, the anisotropy of the dendrite length, defined as A := (/i — 
h)/{h + h), for NFl cells having two dendrites of length l\ and I 2 changes if the 
cells are cultured on the grooved substrates. On the non-structured substrates the 
dendrites are almost equally long (A 0) as the difference in dendrite length of 
the two dendrite increased on the grooved substrates (A 1) (Fig. 5). 

3.3. Discussion 

The response of cells to topography of substratum to which they are attached was 
one of the first phenomena observed in tissue culture (Harrison 1914). Subsequent 
studies have shown that surface topography is an important factor in controlling 
the shape, orientation and adhesion of mammalian cells [2, 7, 4, 3]. 
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We investigated the cellular response of NFl haploinsufficient human mela- 
nocytes to a physical- mechanical signal, a micro-grooved surface topography, and 
quantified their morphological response by a model considering the cell as a cellu- 
lar automaton regulating its shape. On one hand, the effects of NFl reduction to 
a level of about 50% can result in an altered morphology of melanocytes, which 
can easily be measured in single cells by determining the number of dendrites or 
the orientation of dendrites. This is essential for applying a quantitative model. 
Moreover, the cells provide a well-characterized system for testing the effects of 
the reduction of a single protein (neurofibromin) to a level of approximately 50%. 
On the other hand, one function of neurofibromin may be the regulation of the cy- 
toskeleton [22] although details of the relevant signal transduction and intracellular 
regulation are still unknown. 

Our system analysis provides insight into the nature of the response mechanism 
to the neurofibromin reduction without knowing the molecular steps involved. We 
do quantify the differences in terms of a deterministic and stochastic part of the 
regulation [26]. 

Here, we demonstrated that the apparent differences in morphology between con- 
trol and NFl melanocytes is canceled if cells are cultured on micro-structured sub- 
strates. Growing on these grooved substrates, NFl haploinsufficient melanocytes 
develop a more regular shape in respect to two (number and orientation of den- 
drites) of the three measured shape parameters. This corresponds to an increased 
signal-to-noise ratio in our cell model. Moreover, with respect to the number (Fig. 
2) and orientation (Fig. 3) of dendrites NFl haploinsufficient melanocytes behave 
like normal melanocytes through contact with the micro-grooved substrate. In con- 
trast, the grooved substrates caused an increased stochastic strength and therefore 
decreased signal-to-noise ratio in the third observed parameter, the length of den- 
drites (Fig. 4). In respect to this parameter, control and NFl cells become more 
irregular and asymmetric. Obviously, this is caused by the guidance cue given to 
the dendrite through the micro-structures and might be explained by pure me- 
chanical effects. Melanocytes show filliopodia activity at the end of its dendrites 
as it can be partly seen in the scanning electron microscopy image in Fig. le. It 
might be possible that the mechanical stabilization of dendrites stimulates a higher 
rate of dendrite length regulation. The anisotropic change in dendrite length can 
be observed on a regular time scale and can be correlated to nucleus oscillations 
as it will be shown in a following article. 

In conclusion, we have observed that cell shape parameters, as the number of 
dendrites and the orientation of dendrites, are hardly distinguishable between nor- 
mal and NFl melanocytes if cultured on micro- grooved substrates. The question 
arises whether the morphological changes are related to changes in the intracellular 
regulation or just by external forces that do not affect the intracellular function. 
Normal and NFl melanocytes can serve as a model system for following the ef- 
fects of shape development on a molecular level. Moreover, with their defined gene 
defect, NFl melanocytes allow a biophysical approach to description of haploin- 
sufficiency on the level of cell phenotypes. 
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1. Introduction 

Endothelial cells cover the inner surface of the cardiovascular system. Thus, they 
are permanently exposed to fluid shear stress generated by blood flow. Building an 
interface between blood and surrounding tissue, endothelial cells are involved in a 
variety of regulation mechanisms (as blood pressure), transport processes, or the 
maintenance of the barrier function [6]. Most of these mechanisms are influenced 
by shear stress. Endothelial cells in vivo and in culture respond to shear stress 
with the activation of intracellular signal pathways, release of speciflc mediators, 
transcriptional activation, and gene expression. These microscopic processes Anally 
result in a morphological change of cell shape from a polygonal to an elongated 
phenotype [1, 7]. 

Cell alignment and shape change as response to shear stress are considered to 
minimize the local mechanical load and thus to protect cells from hydrodynamic 
injury. On the other hand, it is known, that cells try to maintain their barrier 
function, thus all morphodynamical changes are constrained by trying to preserve 
the monolayer integrity. Consequently, morphological adaptation to shear stress 
requires a coordinated temporal and spatial reorganisation of cell components, as 
the cytoskeleton, and a differentiated regulation of cell-to-substrate and cell-to-cell 
contacts. 

Here we give a short survey of our techniques and current knowledge about 
the response of endothelial cells to shear stress (for details see [3, 6]). In addition, 
the dynamics of cell changes is related to the transendothelial resistance (TER). 
Finally, the results are considered to start the development of a mathematical 
(biophysical) model of a cell monolayer under the influence of fluid shear stress. 



2. Techniques 

In order to study cultured endothelial cells under homogeneous and well defined 
flow conditions, we have developed a cone-plate rheometer system [3, 4]. Cells are 
cultured on the plate where a rotating cone placed above the cell layer produces 
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fluid shear stress which is nearly independent of the radial distance [2] and laminar 
in the range of 0-200 dynjcw? [4]. 

Furthermore, the system is transparent, allowing the on-line observation of 
endothelial cells by phase contrast microscopy during shear stress exposure. In 
addition, several cone-plate chambers are mounted on a revolving plate, allowing 
investigations of manipulated and control cells of the same passage at different 
levels of shear stress within the same period of time [3]. The processing control of 
the system is automated, allowing the observation of cells at selected positions in 
different chambers during a predefined time-course. The angular velocity (i.e. the 
shear stress level) of each head is controlled independently. 

The software also accomplishes data acquisition and image segmentation and 
analysis. The resulting video sequences are investigated to extract morphodynam- 
ical parameters. The processing of each image captures individual cells generating 
outlines consisting of the cell nucleus and the perinuclear organelles [3]. The out- 
lines deliver the position, area, perimeter, orientation, and shape index of each 
cell. Furthermore an algorithm links individual positions of two sequential time 
steps to form trajectories of individual cells as a function of time. A typical result 
for pulmonary trunk endothelial cells (PSEC, for cell culture and medium, see [5]) 
is presented in Fig. lA. Smoothed trajectories are the result of a decomposition 
of motion into locomotion and fluctuations (for details see [3]). 

To determine the TER of the confluent cell layer, thin gold electrodes were 
evaporated at the glass slides (for details see [6]). The impedance magnitude \Z\ 
was determined as a function of frequency between 10 Hz and 1 MHz. Each 
impedance spectrum was analysed by fitting the parameters of an equivalent circuit 
to the experimental data [6]. The equivalent model delivers electrode parameters, 
the Ohmic resistance of the medium Rhuiki the capacitance of the plasma mem- 
branes C, and the transendothelial resistance TER [8]. TER characterises the 
state of cell-to-cell junctions, i.e. the endothelial barrier function. 



3. Quantification of The Adaptation Process 

The presented rheometer system is used to quantify the dynamical response of an 
endothelial PSEC cell monolayer to fluid shear stress with several morphological 
parameters, such as cell position, orientation, and elongation. 



3.1. Cell Trajectories and Heterogeneity 

Figure lA provides an overview of the detected cell trajectories as an overlay with 
the original images taken at 20 min before (top) and 60 min after (bottom) the 
onset of shear stress (starting at t = 0 min with strength r = 50 dyn/cm?). A 
preferred locomotion of the cells parallel to the direction of flow is obvious, but a 
heterogeneity of individual cells can also be observed. 
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Figure 1. Individual trajectories of cells. A: An overlay of 
the starting (top: t = -20 min) and the final image (bottom: 
t = 60 min) is shown. The onset of shear stress exposure starts at 
t = 0 min, its direction 0 is marked as arrow (r = 50 dyn/cm^). 
B: Five typical cell trajectories are displayed at higher resolution. 
Time resolution of the experiment was 1 min. Small points de- 
note the original measurements of the cell centres, crosses (r = 0, 
t < 0) and circles (r = 50 dyn/crm?, t > 0) refer to smoothed 
trajectories (generated according to [3]). 
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To get more detailed information on this heterogeneity we have selected five 
individual cells (Fig. lA/B) which are representative for the cell ensemble. Obvi- 
ously there are some cells (such as No. 51 and 148) responding with highly per- 
sistent locomotion in the direction of shear stress. By contrast, other cells (such 
as No. 48 and 79) perform a less persistent motion with relatively high lateral 
motility but still with preference in the fiow direction. We propose the hypothesis 
that the observed heterogeneity between individual cells of a confiuent culture is 
due to a sub-cellular regulation of cell-to-cell contacts producing spatially varying 
forces on the cells. 

Generally we claim that under resting conditions cell path directions are 
randomly distributed, whereas under shear stress (50 dynlcm^) the trajectories 
tend to be parallel to the direction of shear fiow (see Fig. IB). Moreover, even 
within this fiow-oriented trend, cells more or less frequently change their direction 
and even move transversally to the fiow direction. 

3.2. Analysis of Motility 

Cells within a confiuent monolayer are coupled via cell-cell contacts to each other 
and via basal junctions to the substrate. Thus, cell motion requires a coordinated 
regulation of cell adhesions and releases of couplings combined with a reorganisa- 
tion of the cytoskeleton, in particular of the actin filament system which provides 
the driving forces for cell motion. 

As we have previously shown [3], cell motion can be decomposed in the sum of 
directed locomotion over the substrate (absolute mean value of locomotion velocity 
v{t) and direction ^{t) of locomotion) and spatial fiuctuations ^{t) describing a 
zigzag movement of cells around their mean paths (compare fine points in Fig. IB). 
The decomposition is performed with a sliding Gaussian filter (for details see [3]). 

The behaviour of the distribution of locomotion velocities for the cell en- 
semble is displayed in Fig. 2A/B. Shear stress increased (after a short delay) the 
locomotion velocity of the cell ensemble (Fig. 2A). In addition, cells mainly moved 
into the direction of fiow ©. The effects were reproducible within the same exper- 
iment, whereas changes were less pronounced for locomotion velocities (Fig. 2A). 
This indicates, that the initial response might be more expressed, and that adap- 
tation effects occur on longer time scales. Furthermore, Fig. 2C shows a reduction 
of spatial fiuctuations of cells after the onset of shear stress. Within less than 10 
min fiuctuations were reduced to about 2/3 of the initial value. 

The reduction of fiuctuations is additionally related to the time course of 
TER. As shown in Fig. 2D the step-wise increase of shear stress with intermediate 
resting phases (with a duration of 24 min) triggers a transient increase of TER 
indicating stronger cell interactions during this time interval (with response times 
depending on the magnitude of the applied shear stress). Fig. 2D (bottom) also 
shows, that this is correlated with the reduction of spatial fluctuations Thus 
cell-interactions seem to play a key role in the dynamical response to the onset of 
shear stress. 
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Figure 2. Cell motility and TER. A: Distribution of locomotion 
velocities v for the cell ensemble is shown as grey values. The onset 
of shear stress triggers an increased locomotion. B: Values of the 
directional angles 'ip coded according to the grey value map. Cells 
prefer the direction 0 of flow. C: The increase of shear stress 
reduces cell fluctuations within some minutes. D: Shear stress is 
varied and increased during the experiment as indicated. TER 
follows the onset of shear stress. Also spatial fluctuations ^ are 
reduced due to shear stress. 




204 



P. Dieterich, J. Seebach and H.-J. Schnittler 





Figure 3. Morphological adaptation to fluid shear stress. 
A: Standard deviation a(t) of individual orientations for the cell 
ensemble with respect to the flow direction. Orientation started 
in phase III with a time-delay of about 15 min after the onset of 
shear stress. B: The mean structure index SI maintained a con- 
stant value for a long time and started to decrease after about 
2.5 h (phase IV). 



3.3. Phase-Like Adaptation to Shear Stress 

Besides changes in migration activity, shear stress causes modiflcations of orienta- 
tion and shape of cells, which is a key aspect of the adaptation process. We have 
defined o-(t) as the standard deviation of the orientation distribution of the cell 
ensemble with respect to the direction of flow. A value of about 52^ corresponds 
to a randomly oriented system, lower values indicate alignment. In addition, the 
mean structure index SI =< > (as ratio of area A and squared perimeter 

P) captures the elongation of the cell shape. A value of 1 corresponds to a circular 
object whereas smaller values indicate elongated shapes. 






Shear Stress-Induced Cell Migration 



205 



The beginning of change of these parameters can be used to define temporal 
phases. As shown in Fig. 3, in phaise I, under resting conditions, all mean pa- 
rameters remain constant apart from small variations around their mean values. 
After the onset of shear stress cell motility changes markedly (phase II, see Fig. 2), 
whereas the mean orientation remains constant over a period of time and seems 
to depend on the level of shear stress. Phase III begins with a clear alignment 
characterized by a{t) (Fig. 3A). Commonly, the change of the mean structure 
index also shows a delay, where SI is maintained in the order of hours. SI de- 
creased significantly 150 minutes after the onset of shear stress marking phase IV 
(Fig. 3B). Although cell cultures of different isolates showed heterogeneity, in the 
time-related response all experiments showed a sequential time course composed 
in the sequence of change in migration activity, alignment, and elongation. 



4. Toward a Mathematical Model 

The presented work shows a complex and differentiated dynamical behaviour of 
confluent cell cultures. We have found different collective phases and spatial and 
temporal variations of individual morphodynamic cell properties. Both are signa- 
tures of highly correlated, instationary dynamical systems where cell-to-cell inter- 
actions seem to play a key role. 

Our model to capture these phenomena is schematically shown in Fig. 4. 
We suppose an ensemble of N interacting cells. Cells are put at (.x,y) positions 
coupled via a friction constant to a 2D substrate. The boundaries to neighbour 
cells are obtained via Voronoi construction. The calculation of inertia moments 
over each diagram is used to construct an ellipse corresponding to each cell (see 
e.g. [9]). These ellipses model the spatial elevation of endothelial cells around their 
centre consisting of nucleus and the nearby organelles [3]. This region is supposed 
to receive the mechanical forces of fluid shear stress trying to align this structure 
and even to push it forward on the substrate. 

From a theoretical point of view, nearly arbitrary forces can be implemented 
into the model. However, experimental observations deliver constraints on the 
choice of appropriate types of interactions. The forces are typically transmit- 
ted via the filament structure of the cytoskeleton, which can be approximated 
by a tension-stretching approach. These forces are coupled to each other at the 
cell boundaries via contact proteins. In this context, biochemical analyses of our 
group have recently shown, that tyrosine phosphorylation of certain junction pro- 
teins are correlated with the onset of shear stress which can be used to construct 
experimentally-based interactions. 

However, even in its simplest form, the model shows, that the topology of the 
cell distribution on the substrate leads to a heterogeneous response of cells to shear 
stress. As shown in Fig. 4 shear stress is trying to rotate cells into the direction 
of flow, where rotation of cells clock or counter-clockwise results in forces at the 
cell boundaries, which compensate (at position 1 and 2) or enhance (at position 
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Figure 4. Schematic representation of the model together with 
the qualitative action of shear stress (large arrow). Cell centres 
are denoted as small filled circles and lines represent the corre- 
sponding Voronoi boundaries. Ellipses are calculated from inertia 
moments of the Voronoi areas. Shear stress tries to align cells 
parallel to the direction of flow. However, due to the topological 
positions of cells, the forces at the cell boundaries between two 
cells can point into the same (3) or different direction (1,2). 



3) each other. Thus, already spatial locations of cells can cause a heterogeneous 
response to shear stress within the model. 

We are currently exploring the model with different cell-cell-interactions in a 
close relation to experiments in order to gain further insight into the regulation of 
cell contacts in endothelial cells. 
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Biological cells demonstrate a high degree of internal functional organiza- 
tion and coordination of processes at a molecular level. Almost ten years ago we 
asked [1] the question of whether self-organization in the cells represents only 
a reduced copy of spatiotemporal self-organization phenomena in macroscopic 
reaction-diffusion systems. We noticed then that the length scales of spatial pat- 
terns resulting purely from interplay between reactions and diffusion are limited 
by the diffusion length which is just the distance passed by diffusively moving 
molecules between two reactive collisions. For typical biochemical reactions, the 
diffusion length is of the order of a micrometer, comparable to the total size of a 
single cell and more than the size of cellular compartments. Our conclusion was 
that spatiotemporal pattern formation based on reactions and diffusion does not 
play an important role in the cells. On the other hand, we found [1] that any two 
molecules inside a cell of a micrometer size should meet each other, as a result of 
the diffusive motion, every second! This implies the existence of extremely efficient 
communication between individual molecules inside a biological cell. 

The functioning of a cell is based on a highly coordinated action of a large 
population of molecular machines. These machines, representing single protein 
molecules or their complexes, are far from thermal equilibrium because they receive 
energy in the chemical form. This allows them to operate autonomously, overcom- 
ing the restrictions set by thermodynamics for equilibrium systems. Active protein 
machines are immersed into a water solution providing a passive medium needed 
for the supply of energy and for the communication between the machines. The 
communication is realized through diffusion of small regulatory molecules released 
by a machine and able to afffect the operation of another machine. The system of 
communicating protein machines builds up a dynamical network. 

Such molecular networks are capable of complex collective dynamics, where 
the internal states of individual machines become synchronized and groups of 
coherently operating machines spontaneously develop. A review of our research 
dealing with self-organization in molecular networks of a cell can be found in the 
article [2], based on an invited talk at the Biological Physics Conference in Kyoto 
in 2001. A detailed discussion of mathematical models of complex coherent action 
is given in the recent book [3]. 
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This section includes six contributions dealing with various aspects of com- 
plex network dynamics. K. Kaneko reviews an approach to molecular and cell 
biology from the viewpoint of complex dynamical systems. The contribution by 
H.-Ph. Lerch, P. Stange and B. Hess discusses synchronization and dynamical clus- 
tering in networks formed by single enzyme molecules. Statistical properties of gene 
regulatory networks are considered in the article by Th. Rohlf and S. Bornholdt. 
The contribution by M. Ipsen is devoted to a general problem of reconstructing 
a network structure from its known dynamics. The remaing last two articles deal, 
respectively, with modeling of simple biochemical networks (R. Maithreye and 
S. Sinha) and information aspects of regulatory gene networks (J. T. Kim, Th. 
Martinetz and D. Polani). 
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1. Introduction 

Life system is both complex and complicated [12]. These two features have to be 
distinguished. In a complicated system, many factors are involved and the possi- 
ble variety of combination of such factors is enormous. It requires hard efforts to 
disintegrate it into parts, but in principle it is possible. Indeed in the bioinformat- 
ics, studies along this direction are carried forward with the aid of computer. On 
the other hand, in “complex systems”, one has to face the circular relationship 
in which each part is understood only through the relationship with the whole, 
although the whole, of course, consists of parts. In the complex systems approach, 
we try to understand universal logic that biological systems have to obey through 
such dynamic circulation between the whole and parts. Here we are not concerned 
with determining a specific role of a molecule or a biochemical process; Rather 
we intend to understand some universal features that a class of biological systems 
exhibit, irrespectively of details. 

In the present paper, we briefly sketch the study of this complex systems 
approach to biology both theoretical and experimental mainly focusing on 
the studies at a cellular level. For the study of complex systems, we take two 
standpoints, i.e., constructive biology and coupled dynamical system with internal 
degrees of freedom. 

Constructive approach: Since we are interested in the universal feature of 
a biological system, we need to study some features that are not influenced by 
the details of complicated biological processes. The present organisms, however, 
include detailed elaborated processes that are captured through the history of evo- 
lution. Then, for our purpose, it is desirable to set up a minimal biological system, 
to understand universal logic that organisms necessarily should obey. Hence, the 
approach that should be taken will be “constructive” in nature. This construc- 
tive approach is carried out both experimentally and theoretically. The problems 
discussed here are summarized in the table. 

Some key concepts obtained from coupled dynamical systems 
Experimental strategy to understand general questions (to be given later) is con- 
struction of prototype system for each problem, while as a theoretical strategy we 
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adopt dynamical systems theory. We adopt intra-inter dynamics model where each 
unit (say cell or an individual organism) with internal dynamics (say biochemical 
reaction dynamics) interact with each other. A characteristic feature common to 
biological systems is that these units can replicate. Hence the present dynamical 
systems change their degrees of freedom through temporal evolution. Through ex- 
tensive studies of this class of models, we have found common features in this class 
of models; 

(i) isologous diversification — general tendency of differentiation from identi- 
cal units [9, 18, 19], 

(ii) chaotic itinerancy — emergence of long-term dynamics over several quasi- 
steady states with some self-organized transition rules [8,9,16,23], 

(iii) minority control — tendency that replicators with a smaller size in popu- 
lation control the behavior of the total system [20] 

These concepts are relevant to understand common nature in biological sys- 
tems, while the problems in the above table are discussed by these concepts. It 
should be noted that instead of discussing fitness in replication, we discus recur- 
siveness that emerges from rather plastic replication process. 

In the following sections, we discuss each item in the Table 1, both from 
constructive and dynamical systems viewpoints. At each section we raise a basic 
question for each item, and propose an answer based on dynamical systems theory. 
For the first three items in Table 1, we explain briefly a theoretical model, which 
leads to the proposed answer, while some constructive experiments are briefly 
described. For the last two items, we describe only the question and outline of the 
answer, leaving the description of model studies to original papers. 

Table 1 



Construction of 


Experiment 


Theory 


Question 


multicellular 

system 


interaction- 
induced cell 
differentiation 


isologous diversi- 
fication in inter- 
intra dynamics 


robustness in 
development 


developmental 
process (I) 


controlled 
differentiation 
from stem cell 


emergence of 
differentiation 
rule by chaos 


irreversi- 

bility 


developmental 
process (II) 


artificial 
construction of 
tissues 


self-consistency 
between pattern 
and dynamics 


origin of 
positional 
information 


replicating 

system 


in-vitro 

replicating 

system 


minority 

control 


origin of 
information 


cell system 


dividing liposme 
with internal 
reaction 


dynamic bottleneck 
in autocatalytic 
reaction system 


evolvability 
with recursive 
production 
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2. Cell DifFerentiation 

Question: How does an identical cell diversify into a discrete set of cell types 
through development? Together with this diversification, how are several distinct 
cell types formed that maintain recursive production? How is a character of each 
cell type stabilized under molecular fiuctuations, and how is the number distribu- 
tion of cell types stably maintained? 

Logic 

To answer the question, “isologous diversification theory” is proposed. The 
theory explains how amplification of noise-induced slight difference between cells 
leads to a noise-tolerant society with differentiated cell types. Through the inter- 
action among cells with internal chemical dynamics, a state with homogeneous 
cells is destabilized. Cells diversify into discrete types spontaneously [18,19]. 

This diversification of cell types from a single cell type is shown to be a general 
consequence of interacting cells with biochemical networks and cell divisions, as is 
confirmed by several model simulations. According to the theory, differentiation 
proceeds first by loss of synchrony of intracellular oscillations as the number of 
cells increases. Then the chemical composition of the cells is differentiated. The 
differentiated compositions become inherited by the next generation, and lead to 
determined cell types. As a result of successive occurrence of the cell differentiation, 
the cell society will be composed of different cell types. The whole developmental 
process is shown to be stable against molecular and other external fluctuations, 
where amplification of noise-induced slight difference between cells leads to a noise- 
tolerant society with differentiated cell types. This robustness is a remarkable 
feature of isologous diversification, in contrast to the conventional threshold-type 
mechanism for development. 

Model 

We have studied several models [2,17 19] choosing (a) the internal variables 
and their dynamics, (b) interaction type, and (c) the rule to change the degrees of 
freedom (e.g., cell division). 

As for the internal dynamics, auto-catalytic reaction among chemicals is cho- 
sen, which often leads to nonlinear oscillation in chemical concentrations. As the 
interaction mechanism, diffusion of chemicals between a cell and its surroundings 
is chosen. 

To be specific, we mainly consider the following model here. First, the state 
of a cell i is assumed to be characterized by the cell volume and a set of functions 
representing the concentrations of k chemicals denoted by m = 1, • • • ,k. 
For the internal chemical reaction dynamics, we choose a catalytic network among 
the k chemicals. The network is defined by a collection of triplets {£J,m) repre- 
senting the reaction from chemical m to ^ catalyzed by j. The rate of increase of 
xl{t) (and decrease of through this reaction is given by x[^^\t){x[^\t))^ , 

where a is the degree of catalyzation Each chemical has several paths to other 
chemicals, and thus a complex reaction network is formed. These reactions can 
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include genetic processes, where xl(t) is regarded a degree of expression of a 
given gene. 

Cells interact with each other through the transport of chemicals out of and 
into the surrounding medium. As a minimal case, we consider only indirect cell-cell 
interactions through diffusion of chemicals via the medium. The transport rate of 
chemicals into a cell is proportional to the difference in chemical concentrations 
between the inside and the outside of the cell, and is given by — xf’(O), 

where denotes the diffusion constant, and is the concentration of the 

chemical at the medium. With this type of interaction, corresponding chemicals in 
the medium are consumed. To maintain the growth of the organism, the system 
is considered to be immersed in a bath of chemicals through which (nutritive) 
chemicals are supplied to the cells. 

As chemicals flow out of and into the environment, the cell volume changes. 
The volume is assumed to be proportional to the sum of the quantities of chemicals 
in the cell, and thus is a dynamical variable. Accordingly, chemicals are diluted as 
a result of the increase of the cell volume. 

In general, a cell divides according to its internal state, for example, as some 
products, such as DNA, are formed, accompanied by an increase in cell volume. 
Again, considering only a simple situation, we assume that a cell divides into two 
when the cell volume becomes double the original. At each division, all chemicals 
are almost equally divided, with random fluctuations. 

Of course, there can be a variety of choices on the chemical reaction network. 
However, the observed differentiation process with the increase of the cell number 
does not depend on the details of the choice, as long as the network allows for the 
oscillatory intra-cellular dynamics leading to the growth in the number of cells. 

Results from the Model 

Up to a certain number of cells (which depends on the model parameters), 
dividing cells from a single cell have the same characteristics. When the number 
of cells rises above a certain (threshold) value, the state with identical cells is no 
longer stable. Small differences introduced by the fluctuation start to be amplified, 
until the synchrony of the oscillations is broken. Then the cells split into a few 
groups, each having a different oscillation phase (see Fig. la). The cells belonging 
to each group are identical in phase. This diversification in the phases, however, 
cannot be called cell differentiation, because the time average of the biochemical 
concentrations reveals that the cells are almost identical. The change of phases at 
the second stage is nothing but dynamic clustering studied in coupled nonlinear 
oscillators [9,10]. 

With the further increase of the cell number, the average concentrations of the 
biochemicals over the cell cycle become different. The composition of biochemicals 
become different for each group. The orbits of chemical dynamics plotted in the 
phcise space of biochemical concentrations, lie in a distinct region within the phase 
space (see Fig. lb). 




Constructive and Dynamical Systems Approach to Life 



217 



Hence distinct groups of cells are formed with different chemical characters. 
Each group is regarded as a different cell type, and the process to form such types 
is called differentiation. With the nonlinear nature of the reaction network, the 
difference in chemical composition between the clusters is amplified. By the for- 
mation of groups of different chemical compositions, each intra-cellular biochemical 
dynamics is again stabilized. 

After the formation of cell types, the chemical compositions of each group 
are inherited by their daughter cells. In other words, chemical compositions of cells 
are recursive over divisions. The biochemical properties of a cell are inherited by 
its progeny, fixed or determined over the generations. After several divisions, such 
initial condition of units is chosen to give the next generation of the same type as 
its mother cell. 

Experiment 

The above theory can be generally applied to developmental process in the 
present multi-cellular organisms. However, the present organisms adopt further 
detailed mechanisms in addition to the above process, to stabilize it. Then, it is 
important to study experimentally a multicellular system as simple as possible. As 
this constructive experiment, Yomo’s group, [21], by using E. coli cells, found that 
in a highly dense culture (with strong interaction), the E. coll cells spontaneously 
differentiate into a few groups with different enzyme activities, although they have 
identical genes. 



3. Differentiation from Stem Cell 

Question: In development of multi-cellular organisms, there exists a cell, called the 
stem cell, that either proliferates or differentiates to several other cell types. There 
exists a rule for such differentiation from multipotent stem cells. How are such 
differentiation rules generated? How can several cell types coexist stably under 
fluctuations? 

Logic 

We discuss this problem by extending the isologous diversification to include 
chaotic dynamics. Two fundamental features of stem cell systems - stochastic dif- 
ferentiation of stem cells and the robustness of a system due to regulation of this 
differentiation - are found to be general properties of a system of interacting cells 
exhibiting chaotic itinerancy for intra-cellular reaction dynamics. 

Results from Model Simulations 

The most interesting example in the model discussed in §2 is the formation 
of stem cells. In an example given in Fig.lc [2], a cell type, denoted as “S”, either 
reproduces the same type or forms different cell types, denoted for example as type 
A and type B (i.e., S — > S,A,B). iType B cell replicates as B B), while the 
type A cell either eplicates or differentiates as A ^ A, A1,A2. The type A1 and 
A2 cells are determined (i.e., A1 ^ Al, A2 A2). This hierarchical organization 
is often observed when the internal dynamics have some complexity, such as chaos. 
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Fig.i b) 



Fig.lc) 

Figure 1. Schematic representation of cell differentiation pro- 
cess, plotted in the phase space of chemical concentrations. 

It is found that stem cells differentiate into other cell types stochastically due to 
instability caused by cell-cell interactions. 

The differentiation here is “stochastic”, arising from chaotic intra-cellular 
chemical dynamics. The choice for a stem cell either to replicate or to differentiate 
looks like stochastic as far as the cell type is concerned. Since such stochasticity is 
not due to external fluctuation but is a result of the internal state, the probabil- 
ity of differentiation can be regulated by the intra-cellular state. This stochastic 
branching is accompanied by a regulative mechanism, as will be discussed in the 
next section. 
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4. Stability and Irreversibility in Development of a Cell Society 

Question: In development of multi-cellular organisms, there is stability both at 
microscopic and macroscopic levels. The developmental process is stable against 
molecular fluctuations, and also against macroscopic damage such as elimination 
of some cells. How does stability emerge in a multi-cellular system? Also, there 
is a successive determination from totipotent ES cell, to multipotent stem cells, 
and Anally to several determined cell types. There exists a clear timeline in the 
direction of the loss of potency to form a variety of cell types, in the normal devel- 
opment course. How is such irreversibility generated and how is it characterized 
quantitatively? Is such irreversibility related with developmental stability? 

Logic 

Using the model studies in §3 and 4, and also by examining several other 
models, it is shown that the cell types and their number distribution are robust 
against perturbations [3,6,15] Furthermore, it is shown that the irreversible loss of 
multipotency accompanying the change from a stem cell to a differentiated cell is 
a general course of interacting cell systems, and is characterized by a decrease in 
the intracellular chemical diversity and in the complexity of the cellular dynamics. 

Results from Model Simulations 

First, intra-cellular dynamics of each cell type in our models are stable against 
such perturbations. Then, one might think that this selection of each cell type is 
nothing more than a choice among basins of attraction for a multiple attractor 
system. If the interaction were neglected, a different type of dynamics would be 
interpreted as a different attractor. In our case, this is not true, and cell-cell inter- 
actions are necessary to stabilize cell types. Given cell-to-cell interactions, the cell 
state is stable against perturbations on the level of each intra-cellular dynamics. 
The concept of “partial attractor” is proposed in [13], to explain this situation. 

Second, each cellular state is also stable against perturbations on the cell 
distribution. The macroscopic stability is clearly shown in the spontaneous reg- 
ulation of differentiation ratio in the last section. Depending on the distribution 
of the other cell types, the orbit of internal cell state is slightly deformed. For a 
stem cell case, the rate of the differentiation or the replication (e.g., the rate to 
select an arrow among S 5, A, B) depends on the cell-type distribution. For 
example, when the number of “A” type cells is reduced, the orbit of an “S-”type 
cell is shifted towards the orbits of “A”, with which the rate of switch to “A” is 
enhanced [2]. 

If each cellular state were an attractor of internal chemical dynamics, the 
microscopic stability of a cellular state would be achieved. In this case, however, 
neither the macroscopic stability of a cellular state nor the stability of develop- 
mental process (how each cell type appears in the time course) is explained. Thus 
the cell-cell interaction is important. On the other hand, if the cell type number 
distribution (accordingly the interaction) is changed up to some degree, each intra- 
cellular dynamics keep its type. Hence, discrete, stable types are formed through 
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the interplay between intra-cellular dynamics and interaction. The recursive pro- 
duction is attained through the selection of initial conditions of the intra-cellular 
dynamics of each cell, so that it is rather robust against the change of interaction 
terms as well. 

In a real organism and in our model, there is a clear temporal flow, as for the 
loss of multipotency. Initial ES cells have totipotency to create all other types of 
cells, then stem cells have limited multipotency. The cells at later stage lose their 
multipotency. In our model simulation, this process is characterized by the change 
of the following quantities [6, 15]: 

(I) Increase of the stability of intra-cellular state 

(II) Decrease in the diversity of chemicals in a cell 

(III) Loss of chaotic instability in the intra-cellular dynamics 

The degree of (I) could be determined by a minimum change in the interaction 
to switch a cell state, by properly extending the “attractor strength” introduced 
in [11]. The degree of determination is roughly measured as the minimum per- 
turbation strength required for a switch to a different state. The diversity (II) is 
computed, for example, by the Shanonn entropy of the chemical concentrations 
over all chemical species, i.e. p{j)logp{j) with p{i) = x{i)/ x{j). Initial 

totipotent cells have high diversity, which decreases in the course of differentiation. 
For details and for (III), see [6]. 

As a related constructive experiment, note controlled tissue generation by 
Asashima’s group [1]. After taking undifferentiated cells from the animal cap of 
Xenopus, they put them into the solution of a protein “activin” for a while, and 
then cultured the cells, to examine how they develop later. By changing only the 
concentration of activin or few other chemicals, they succeeded in forming almost 
all tissues of the frog. The following two points should be noted in relationship 
with theoretical results, (i) jump-over generation of tissues, jumping over normal 
temporal course of development, (ii) community effect the formation of tissue 
is highly dependent on the number of cells. These two features show that naive 
picture for threshold mechanism for development is not sufficient. The feature (i) 
suggests the existence of attractors or attracting states as a dynamical system at a 
tissue level. The feature (ii) suggests that the differentiation is determined through 
interplay between intracellular dynamics and a cell ensemble state, as adopted in 
our study. 



5. Pattern Formation and Origin of Positional Information 

Question: Biological pattern formation (morphogenesis) is understood as a change 
of genetic expression depending spatially on the gradient of chemicals. To encom- 
pass the macroscopic pattern formation, it is often discussed that the positional 
information is formed according to the gradient of some chemical concentration 
which influences the concentration of such signal molecules that changes genetic 
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expressions. However, how is such positional information generated? To form gra- 
dients, is spatial differentiation of cells necessary. Therein lies a question, which is 
the first, positional information or differentiation of cells in space? 

Logic 

Including the spatial structure with diffusion of chemicals, we show that the 
differentiated types by the mechanism discussed so far are organized partially to 
form a pattern that is robust against perturbation. With this pattern formation, 
gradients of chemicals are formed that consolidate the differentiation of cell types. 
Accordingly positional information and dynamic differentiation mutually reinforce 
and stabilize each other. Dynamic differentiation is transferred to a spatial pattern. 

Indeed, by extending the models in §2-4, to include spatially local interaction, 
the above logic for the emergence of positional information is obtained. See [3 5,7] 
for details. 



6. Origin of Bioinformation 

Question: Among many chemicals in a cell only some chemicals (e.g., DNA) are re- 
garded to carry genetic information. Why do only some specific molecules play the 
role to carry the genetic information? How have roles in molecules been separated 
into information carrier and metabolism? Is the separation a necessary course of 
a replicating cell with internal biochemical reaction dynamics? 

Logic 

As a step in an investigation of the origin of genetic information, we study 
how some species of molecules are preserved over cell generations and play an 
important role in controlling the growth of a cell. We consider a model consisting 
of protocells. Each protocell contains mutually catalytic molecule species. Through 
divisions of the protocells, the system reaches and remains in a state in which 
there are only a few highly catalytic molecules, preserved through selection of 
very rare fluctuations. In this state, the molecules with high catalytic activities, 
minor in population, are shown to control the behavior of the protocell. This 
minority molecule species acts as the carrier of heredity, due to the relatively 
discrete nature of its population, in comparison with the majority species which 
behaves statistically in accordance with the law of large numbers. The relevance 
of this minority controlled state to evolvability is discussed [20]. 

Experiment 

As an experiment corresponding to this problem, note the experiment by 
Matsuura et al. [22], who succeeded in constructing an in-vitro replication sys- 
tem. The system consists of enzymes for the synthesis of DNA, DNA, and RNA. 
Through mutual catalytic process the whole set of chemicals is replicated by itself. 
It is also shown that the system heis evolvability only if the number of DNA in the 
system is only few. Otherwise, the system gradually loses its activity to replicate 
itself. These experimental results are consistent with the minority control theory 
presented. 
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7. Origin of a Cell with Recursive Growth 

Question: A cell consists of several replicating molecules that mutually help the 
synthesis and keep some synchronization for replication. At least a membrane that 
partly separates a cell from the outside has to be synthesized, keeping some degree 
of synchronization with the replication of other internal chemicals. How is such re- 
cursive production maintained, while keeping diversity of chemicals? Furthermore 
this recursive production is not complete, and there appears a slow “mutational” 
change over generations, which leads to evolution. How are evolvability and recur- 
sive production compatible? 

Logic 

To answer the above question, we study a system with many chemicals which 
catalyze mutually. These mutual catalyzations form a network. The dynamics of 
chemical concentrations are often chaotic in nature, and for some conditions, they 
show chaotic itinerancy in which a quasi-recursive chemical state is maintained over 
generations, and then switches from it through unstable chemical states. Here, con- 
centrations of some chemicals are so low, and the number discreteness is important. 
Then the above chaotic itinerancy is highly influenced by whether the number of 
such minority chemicals is zero or not. Through the on-off switch of such minor- 
ity molecule numbers, the system shows successive switches over quasi- recursive 
states. This leads to the evolution of recursive states [14]. 



8. Discussion 

To sum up the studies on cell differentiation and development so far, supports are 
given to the following conjecture. 

Assume a cell with internal chemical reaction network whose degrees of free- 
dom is large enough. Cells interact with each other through the environment. 
Some chemicals are transported from the environment. Through this process the 
cell volume increases and the cell is divided. If the initial cell state is quite stable, 
no differentiation occurs, when the competition for chemical resources is so high 
that the cell division stops at some stage. On the other hand, for some reaction 
networks, cells differentiate and the cell number continues to increctse. Hierarchical 
rule for cell differentiation is generated. The initial cell types have large chemi- 
cal diversity with temporal change of chemical concentrations. As the number of 
cells increases and the differentiation progresses, irreversible loss of multipotency 
follows. The realized states of cell types are temporally fixed while the number 
distribution of such cell types are stable against perturbations, following the spon- 
taneous regulation of differentiation ratio. 

When spatial factor is involved, the above differentiation induced by dynam- 
ical instability is consolidated to spatial pattern. With this process gradients of 
chemical concentrations are organized that provide positional information. The 
pattern formation with differentiation and positional information stabilize each 
other. 
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For the above development, the role of genes is not explicitly implemented. 
Still, the gene expression is relevant to stabilize the dynamics-originated differ- 
entiation. In a system with complex reaction network, it is shown that some 
molecules in minority start to carry the information, with regards to control of 
other molecules, and preserved transfer to the next generation. Whether the num- 
ber of minority molecule is zero or non-zero is crucial to the behavior of a cell, 
which provides a basis for “programmable behavior” in a biological system. 

To sum up, our viewpoint can be summarized as “plastic dynamics first, con- 
solidation to information iater”. Throughout the present research, the importance 
of plastic dynamics in a biological system is emphasized, in contrast to a viewpoint 
of life as a tightly determined, highly programmed system. 
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1. Introduction 

Living cells are highly organized systems. The question has been posed [1], whether 
the principles of self-organization in small biological cells are the same as in 
macroscopic chemical reactions like, e.g. the Belusov-Zhabot insky reaction [2]. In 
this chemical reaction-diffusion system complex wave patterns and spirals can be 
formed with a macroscopic wave length. The length scale of wave phenomena 
in chemical reaction-diffusion systems depends only on the diffusion constant D 
of reacting species and the reaction rate k according to Ldin = V Dk [1,3,4]. If we 
Etssume for typical biochemical reactions D = 10“'’crn^s“* and k = 10'^s~\ then 
= 1/iin. Therefore, there must be other mechanisms than spatiotemporal pat- 
tern formation of the reaction-diffusion type to maintain the high degree of order 
necessary for the physiological operation of the cell at submicrometer levels [1,3,4]. 



2. Networks of Protein Machines 

In this section we want to formulate the conditions under which enzymes in the 
small volumes of living cells can operate in a special kinetic regime. If the en- 
zymes can interact by exchanging small molecules acting as substrate, product, or 
regulatory molecules, a network of globally coupled protein machines is formed. 

Let us assume that a cell or cellular compartment has a linear size L and 
that the diffusion constant of small molecules is D. Accordingly, the mixing time 
Lnix for such molecules, i.e. the time span after which a molecule released at some 
point can be found, with equal probability, anywhere inside the volume, can be 
estimated [5], on the order of magnitude, eis 
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Typical values are L = Ijim. and D = 10“^cm^s“^ for small molecules in aqueous 
solution; we get t^ix = Inis. 

Suppose that we have a small molecule diffusing freely inside a microvolume 
and an enzyme molecule which we assume to be stationary on this time scale. 
The enzyme has a target group of size R on its surface (i.e. the binding site) to 
which the small molecule can bind. The time needed for the small molecule released 
somewhere in the volume to find a given enzyme and to bind to its target is defined 
as the traffic time ^traffic- Using the theory of diffusion-controlled reactions, we 
find [3“5] 



^traffic — ( 2 ) 

For R = Inm, L = Ifim, and D = 10“^'^cm^s“^ we have ^traffic = Is. That means 
that a small molecule in a microscopic cellular volume can find a given enzyme 
molecule by Brownian diffusion and bind to it within one second. 

If we have not only one enzyme molecule but a whole population of N inde- 
pendently and spatially uniformly distributed enzyme molecules inside the volume, 
the average time necessary for a small molecule to find its first target on an enzyme 
molecule is given by 



We can define the ratio 





L'* 


(3) 


N 


“ NDR' 


^transit 


L 


(4) 


^niix 


~ ~NR' 



giving the number of times a messenger molecule will cross the entire volume before 
it finds a target. With this definition we can derive a critical number A^crit 



= |. (5) 

If the number N of enzymes within the volume is much smaller than A^crit , N 
A^crito messenger molecule will find its first target after it has crossed the 
whole reaction volume several times. Independent of its initial position, it reaches 
the target. If, however, N ^ A^crit, the diffusing messenger molecule will find 
its target before it has crossed the reaction volume. It will therefore bind to an 
enzyme molecule in the close neighbourhood of its starting point. In the first case, 
communication committed by diffusing messenger molecules is global, whereas, 
with the number of enzymes N exceeding the critical number A^crit, we have only 
local communication. For a bacterial cell (L = l/j,m) the critical number of enzymes 
is A^crit = 1000; for a cellular compartment {L = 0.1/im) we have A^crit, = 100. 

Enzymes are the molecular machines of a living cell. Some act as molecular 
motors, others transport small molecules or even macromolecules across mem- 
branes, and others catalyze chemical reactions. In any case, machines operate far 
from thermodynamic equilibrium and need energy to do work. This energy is sup- 
plied in discrete portions by the binding of molecules or the fission of energy-rich 
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chemical bonds, as for example, in the case of ATP. After having taken up en- 
ergy the protein is in a conformational state far from equilibrium. In the following 
relaxation phase the protein returns back to equilibrium. 

According to this concept of conformational relaxation introduced by Blu- 
menfeld [6] , the duration of a cycle of such a protein machine is mainly determined 
by the duration of the relaxation period. The overall duration time r of the entire 
cycle varies from less than a millisecond to several seconds for different protein 
machines. Comparing the characteristic diffusion times with the duration r of a 
single enzymic turnover cycle, we see that the time needed to complete a reac- 
tion cycle may be much longer than the diffusion times of the small messenger 
molecules: 

^ ^transit ^ ^inix* (b) 

If this condition holds true, a population of protein machines effectively represents 
a molecular network with rapid global communication. A messenger molecule re- 
leased by a given protein machine may find any of the other machines in the 
cell with equal probability independent of their locations, because ^transit ^ ^mix- 
Moreover, this would happen within a timespan much smaller than is needed for 
a single machine to complete its cycle due to r ^ ^transit- In the whole system 
of globally coupled molecular machines the states of individual members of the 
network are now rigidly correlated, and the whole network behaves like a single 
dynamic object. 

In the literature, theoretical models of coupled periodic [7, 8] and chaotic [9] 
oscillators have been discussed, especially with respect to various synchronization 
phenomena of the coupled oscillators. Such synchronization phenomena can also 
be found in biological systems, for example in networks of pacemaker cells in the 
heart [10], in populations of synchronously flashing fireflies [11] or populations 
of synchronously chirping crickets [12]. Populations of yeast cells can synchronize 
their metabolic activities [13,14]. Finally, synchronization phenomena are possible 
in networks of nerve cells and can play an important role in the brain [15, 16]. 

Below we are going to discuss, as an example of a molecular network, a 
population of allosteric enzymes in a microvolume. We shall see that, under certain 
circumstances, this system can undergo a transition to coherent action of the 
protein machines characterized by synchronization of turnover cycles of individual 
enzymes. 



3. Mutual Synchronization of Enzymic Reaction Cycles 

Applying Blumenfeld’s concept of conformational relaxation to the operation of 
an enzyme machine, its turnover cycle can be modeled (see Fig. 1) by a diffusive 
drift motion along a one-dimensional conformational coordinate (/>, described by 
the stochastic Langevin equation 

d4 

dt 



V + T]{t) 



(7) 
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Figure 1. Model of the enzyme as a protein machine. (For col- 
ored picture see color plate 6). 



with constant drift velocity v and white Gaussian noise r] to take into account 
intramolecular fluctuations. For small noise intensities cr, the mean turnover time 
(r) and the statistical dispersion of turnover times ^ can be estimated [18] at 
(r) ^ 1/v and ^ The enzyme is waiting in the ground state until 

a substrate molecule binds at rate ly, and the enzymic reaction cycle is started 
at (/) = 0, corresponding to the first conformational substate. The progress of the 
reaction is described by Eq. (7). At the phase point (/) = a product molecule is 
released. The mean time r\ of product release is given by t\ = (l)p/v. After that the 
enzyme returns through a relaxation phase to the ground state. We assume that 
the enzyme is allosterically regulated by its product molecules. Product molecules 
formed by the enzyme can either decay at rate 7 , or they can bind to a regulatory 
binding site at the enzyme molecule at rate fj, and influence the binding rate 
of the substrate molecules, so that a feed back loop is formed. The probability 
rate fi can be written as the product of the probability rate (5 of an individual 
regulatory molecule to bind to the empty binding site of the enzyme in the ground 
state and the number of the free product molecules m. Regulatory molecules can 
dissociate at rate k. Binding of regulatory product molecules can lead to activation 
or inhibition of the enzyme, changing the binding rate of substrate molecules from 
u = jy() to ly = iy{)/x^ with y < 1 for activation and y > 1 for inhibition. For 
both cases, the enzymes can act in the synchronous regime of molecular networks. 
This has been theoretically investigated in detail not only for allosteric reactions 
with product activation [17,18], product inhibition [19], and enzymes composed of 
several cooperative subunits [ 20 ] , but also for non-allosteric reactions with closed 
reaction loops [ 21 ]. 

We will now focus, as an example, on an enzymic reaction with allosteric 
product inhibition in a small microvolume. If the strength of allosteric regulation 
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Figure 2, Distribution over cycle phases (a) and time depen- 
dence of the number of product molecules (b) for the asynchronous 
reaction regime in a population of AT = 400 enzymes* The reac- 
tion parameters are x — 10^, u ^ 1, r = 1, /?r = 0.03, j/qt = 10, 
7T = 15, KT = 20, Ti = 0*55r, and a = 0 {From [19]). 




Figure 3. Distribution over cycle phases (a) and time depen- 
dence of the number of product molecules (b) for the synchronous 
reaction regime in a population of = 400 enzymes. The reac- 
tion parameters are the same as in Fig. 2a, but with binding rate 
(3 of regulatory molecules /3r = 0.1 (From [19]). 

(i.e. the binding rate [3 of regulatory molecules) is low, the classical regime of 
enzyme kinetics is realized, characterized by incoherent action of individual enzyme 
molecules. This can be seen in Fig. 2a, where, in the histogram of the distribution 
over cycle phases 0 at a given time, the phases are uniformly distributed. The 
number m of free substrate molecules in the system fluctuates with time around 
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a certain mean value (see Fig. 2b). Increasing the strength of allosteric regulation 
above a critical value results in the coherent action of the whole enzyme population. 
The distribution over cycle phases (f) is now strongly non-Poissonian, showing a 
maximum that indicates correlations between the turnover cycles of enzymes. Now 
the enzyme molecules act coherently in one group (see Fig. 3a). This synchronous 
behaviour is reflected in the sharp spiking in the number m of product molecules 
with a period determined by the duration of a single turnover cycle (Fig. 3b). 

The investigations have been extended to enzymes composed of several sub- 
units [20]. Depending on how many regulatory molecules are bound, the subunits 
can interact which each other, their interacting leading to the phenomenon of coop- 
erativity. Assuming the sequential mechanism of cooperativity [22] for an enzyme 
composed of two identical subunits we found that both synchronization of turnover 
cycles of individual subunits within enzymes (intramolecular synchronization) and 
synchronization of turnover cycles within the whole population of enzymes (in- 
termolecular synchronization) can occur. Among the models of cooperativity we 
have studied, the most interesting is heterotropic cooperativity with activation of 
subunits by product molecules. If cooperativity between subunits is low, then in- 
tramolecular and intermolecular synchronization coincide. Increasing the strength 
of cooperativity causes first disappearing of all synchronization. When cooperativ- 
ity is further increased, synchronization comes up again. But now intramolecular 
synchronization is much stronger, whereas intermolecular synchronization is nearly 
absent. 



4. Conclusions 

We have shown in our theoretical investigations that enzymes can form a molecular 
network characterized by coherent action of all the enzyme molecules present in 
a microvolume typical of living cells. This synchronization of molecular turnover 
cycles can occur under a variety of modifications of our model, including allosteric 
activation and inhibition of the enzyme. 

Spontaneous coherent activity of enzymes in microvolumes in the regime of 
a molecular network has not been observed yet experimentally. However, there 
are experiments showing the synchronization of enzymic reaction cycles by exter- 
nal forcing. Populations of cytochrome P-450 monooxygenase enzymes could be 
synchronized by the application of periodic light pulses with a duration slightly 
shorter than the turnover time (1.54 s) of the enzyme [23 25]. This synchroniza- 
tion persisted for some time when the optical forcing was switched off, until the 
intramolecular fluctuations eventually destroyed the coherent molecular dynamics. 
The statistical dispersion of turnover times has been determined experimentally 
at about 20 % [26]. 
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1. Introduction 

Many aspects of life are based on the dynamics of complex molecular networks. 
Examples are the networks of metabolic reactions, of interactions between proteins 
and gene regulatory networks. Today, for several organisms such as E. Coli or S. 
Cerevisiae large databases are available, such that questions about the global 
structure of these networks can be addressed by means of statistical methods. For 
protein networks, a hierarchical, scale-free architecture has been found [3], with 
a small average number of interaction partners per protein (between 1 and 2). 
Thieffry et al. [6] analyzed the gene regulatory network of E. Coli at transcription 
level. The observed statistical distribution of the number of regulatory outputs, i.e. 
of genes controlled by a given regulatory gene, is shown in Fig. 1, the inset shows 
the distribution of the number of regulatory inputs a given gene receives. For 
both distributions the best fit to the data is an exponential. Interpreting genes 
as nodes and regulatory interactions as edges (links) of a directed graph, these 
distributions of in- and out-links deviate clearly from the Poissonian expected 
for a random graph with equal probability for each node to receive a new link. 
Another interesting feature is the low connectivity (average number of interactions 
per gene) of the network, which is between 2 and 3. 

In this paper we address the question of how the observed network topology, 
here for the example of E. Coli, might be related to general dynamical properties 
of gene regulatory networks. We propose a simple evolutionary algorithm in the 
framework of discrete dynamical networks which mimics functional change and 
selection at the single-gene level by a local coupling between an order parameter 
(dynamical activity of a network node) and a control parameter (connectivity) [1]. 
As a result we observe the self-organization of a network topology with an average 
connectivity between 2 and 3, and with a non-random degree-distribution. The 
network dynamics is stabilized in a non-chaotic regime. 
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Figure 1. Statistical distribution of regulatory outputs, i.e. the 
number K of genes controlled by a given regulatory gene {E. Coli^ 
transcription level), log-log-plot. The inset shows the distribution 
of regulatory inputs. Data (squares) were taken from [6], fits pro- 
portional to exp [-K] were added by the authors. 



2. Evolution of Network Topologies Based on Local Dynamical 
Rules 

2.1. The Modeling Framework: Discrete Dynamical Networks 

Discrete dynamical networks are a theoretical idealization of the dynamics found 
in gene regulatory networks: the networks nodes (genes) can take only two states, 
“on” or “off” , depending on regulatory interactions with other genes (at the level 
of transcription) modeled by weighted connections, which are either activating or 
inhibitory. Kauffman [4] introduced Random Boolean Networks, where the state 
of a gene is a logical function of its regulatory inputs, whereas more recent studies 
[7, 2] favor networks, where the state of a gene is determined by a weighted sum 
of its inputs plus a threshold (Random Threshold Networks, RTN). In our study 
we use a threshold network defined in the following paragraph. 

2.2. Definition of the Threshold Network 

Let us consider a network of N randomly interconnected binary elements with 
states a I = ±1. For each site i, its state at time t -f 1 is a function of the inputs it 
receives from other elements at time t: 

a,{t + 1) =sgn (Mt)) (1) 

with 

N 

Mt) = 'Ecij(^j{t) + h. ( 2 ) 

The interaction weights c^j take discrete values Cij = ±1, with Cij = 0 if site i does 
not receive any input from element j. In the following, the threshold parameter h is 
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set to zero. The dynamics of the network states is generated by iterating this rule 
starting from a random initial condition, using parallel updates of the N network 
sites. 



2.3. Network Dynamics 

RTN show complex dynamics, similar to Random Boolean Networks [4]. The num- 
ber of possible system states a = (cti, <J 2 , •••, is 2^. Hence, applying parallel 
updates as discussed here, the system dynamics inevitably settles down to a pe- 
riodic attractor of P repeating states (1 < P < 2^) after a number T of tran- 
sient states. The scaling properties of T and P as a function of system size N 
are highly non-trivial. In particular, a phase transition is observed at a critical 
average connectivity Kc ^ 2 [5] with lengths of transients and attractors diverg- 
ing exponentially with system size for an average connectivity larger than Kc- In 
this dynamical regime, small perturbations spread rapidly through the network, 
whereas near Kc network dynamics is considerably robust. Furthermore, at Kc the 
range of possible dynamic behavior is broad, reflected by scale- free distributions 
of attractor periods and transient lengths with a median scaling proportional to 
y/N. Kauffman postulated that gene regulatory networks may exhibit properties 
of dynamical networks near criticality [4]. This would confine network evolution to 
considerable small average connectivities close to Si hypothesis now becoming 
testable by modern experimental techniques (e.g. microarrays). 



2.4. An Algorithm for Evolving Network Topology 

As a dynamical order parameter we introduce the average activity A(i) of a site i 



A{i) 



1 

T2-T1 



2 



t='i\ 



(3) 



taken between two arbitrary points of time T\ and T 2 in the course of network 
dynamics. Roughly speaking, A{i) characterizes how often (on average) a gene 
changes between the state of being expressed (active) and being inactive. A value 
of A{i) close to one indicates an inflexible expression pattern almost constant in 
time, whereas A{i) 0 means that gene i is turned on and off very often in a rather 
chaotic manner. The first case indicates that the gene is regulated insufficiently, due 
to missing or bad integration into the regulatory network, whereas the latter case 
could result from conflicting signals of (too many) regulatory inputs. Both extremes 
are often hazardous for biological organisms, thus selective pressure would tend to 
favor mutations which lead to additional regulatory inputs in the first case, and vice 
versa in the latter case. These simple assumptions are integrated in the following 
algorithm evolving the topology of our model network (a simplified version of the 
algorithm introduced in [1]): 

1. Choose a random network with an average connectivity Ki^i- 

2. Choose a random initial state vector = 0) = (cri,cr 2 , •••^ctn) for the N 
binary network sites. 
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Figure 2. The selective criterion leading to topological self- 
organization: a gene which is regulated insufficiently (|^(i)| = 1) 
receives an additional regulatory input, a gene which tends to 
show chaotic expression patterns {\A{i)\ < 1), due to conflicting 
regulatory signals, looses one of its inputs. 

3. Calculate the new system states a{t) according to eqn. (2), using parallel 
update of the N sites. 

4. After Tu updates the network topology is changed according to the fol- 
lowing local rewiring rules: 

5. A site i is chosen at random and its average activity A{i) over the last T 
updates is determined (T <Ty). 

6. If |A(i)l — i receives a new link cij from a site j selected at random, 
choosing Cij = -f 1 or — 1 with equal probability. If \A{i)\ < 1, one of the 
existing non-zero links of site i is set to zero. 

7. Go to step number 3. and iterate. 

Selective pressure for moderate gene activity between the extremes of either 
static or chaotic dynamics enters into the model via local rewiring (step 6.): genes 
with very static expression patterns (|A(i)| = 1) receive a new (random) input 
- this adjusts their dynamics to the overall expression pattern in the network, 
whereas genes with fluctuating dynamics (|A(z)| < 1) reduce their connectivity, 
which favors simplicity of input patterns. 

In step 5. and 6. the average correlation Corr(i, jf) between two sites i and j, 
defined as the average over the products (Ti{t)aj{t) in dynamical network evolution 
between two distinct points of time T\ and T 2 

1 

Corr(z,j) = — ^ (4) 

can be applied instead of A{i) (results not shown here). If |Corr(z,j)| = 1, 2 
receives a new link cij from site j, choosing Cij = -hi or —1 with equal probability, 
if |Corr(z, j)| < 1, one of the existing non-zero links of site i or of site j is set to 



zero. 
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2.5. Results 

Independent from the initial conditions (initial state vector and connectivity) we 
observe a robust evolution of a network topology with an average connectivity K 
between 2 and 3; this is shown in Fig. 3 for a network of size N = 2048. One 
finds small, limited fiuctuations around the mean value due to the evolutionary 
dynamics. For large networks K approaches the critical value, where a percolation 
transition from ordered to chaotic dynamics is found (for this cla^s of networks 
slightly below K = 2). 




Figure 3. Independent from the initial conditions the proposed 
model leads to a robust evolution of a network topology with 
average connectivity Krv{t) between 2 and 3, shown here for two 
networks of size N = 2048 with different initial connectivities 
The mean value over the evolutionary process is K = 2.49. 

For the averaged statistical distribution p{K) of in- and out-links (Fig. 4) of 
the evolved networks one observes a fiat exponential decay 

p{K) ^Po-exp[-aK], (5) 

with po and a constant, similar to the findings for the transcriptional network of 
E. ColL This distribution clearly deviates from the Poissonian 

Prandom exp [ (6) 

expected for a random network with the same average connectivity K. 

Since the central step of our topology-evolving algorithm - step 6. - involves 
a local coupling between a dynamical order parameter (average activity) and a 
topological control parameter, one would expect a self-organization of global dy- 
namics showing deviations from random networks, corresponding to the evolved 
non-random structure. Indeed one observes self-organization processes stabiliz- 
ing the dynamics of the evolving networks in a regime of moderate, non-chaotic 
dynamics: the ratio between inactive or “frozen” genes and active genes, which 
occasionally change their expression state, is at an intermediate level, contrary 
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Figure 4. Statistical distribution p{K) of the number of regula- 
tory interactions K per gene in the proposed model for a network 
of size N = 64. Compared to the Poisson distribution for random 
networks with K = 4.46, it shows a flatter decay proportional to 
exp[— R"], similar to observations for the transcriptional network 
of E. Coli 



0 

0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 

average connectivity 

Figure 5. The fraction of “frozen” genes as a function of the av- 
erage connectivity for an evolved network of size = 64 (crosses). 

The dashed line shows the corresponding curve for random net- 
works. 

to the steep decay of the fraction of frozen genes found for randomly generated 
networks with increasing K (Fig. 5). 



3. Discussion 

Discrete dynamical networks have been discussed in theoretical biology as mod- 
els of general dynamical properties of genetic regulatory networks since about 30 
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years. In the framework of such models we studied possible mechanisms of struc- 
tural self-organization of genetic regulatory networks as a consequence of functional 
change at the single-gene level and selection. We started from the assumption, that 
for a gene with an inflexible, “frozen” expression pattern biological selection might 
favor mutations leading to new regulatory inputs (new functions), whereas genes 
with high number of (conflicting) regulatory inputs tend to push towards chaotic 
patterns of activity, thus mutations reducing the number of inputs would be fa- 
vored by selection. Our model, based on a local coupling between a dynamical 
order parameter (activity or correlation of genes) and rewiring of regulatory inter- 
actions, predicts an average connectivity and non-random network structure well 
compatible with current experimental data, as for example, for the transcriptional 
network of E. Coli. Hence, one might speculate that similar mechanisms involving 
an interplay between network dynamics, mutations and selection may play a role 
in the evolution of regulatory networks of simple organisms. 
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1. Introduction 

In systems of dynamical networks, a fundamental role is played by the pattern 
of connections which determine the interactions among the individual elements of 
the network. Such patterns of connections are specified by a graph, whose nodes 
and edges describe the respective units in the network and their mutual connec- 
tions; graph theory [1-4] therefore constitute a key role in the characterization 
and investigation of the properties of network structures. The examples of such 
structures are wide spanned, and in recent years, the organizational structure and 
characteristics of physical and social networks such as the worldwideweb [5,6], 
the Internet [7], collaboration networks [8] and small world societies [9], as well 
as linguistic [10], neural [9,11] and protein folding networks [12], have received 
substantial scientific attention. In addition, properties of plastic networks with 
evolving patterns of connections have been considered in [13,14]. A detailed re- 
view of many of these studies was given in [15]. 

Much effort is invested in understanding how the structure of a network is 
mapped to its functional and operational properties. On the other hand, in ap- 
plications ranging from bioengineering to neurosciences one also needs to design 
networks with a given function or reconstruct a network from its dynamics. Tak- 
ing into account the great complexity of network dynamics, explicit solutions of 
such inverse problems of graph theory are difficult. But graph reconstruction may 
also be achieved without any knowledge of rules, by running an artificial evolution 
process through which a network learns to generate certain dynamics by adjusting 
its internal organization. Indeed, evolutionary algorithms are known to yield effi- 
cient solutions for complex optimization problems [16]. For the problem of graph 
reconstruction, such an approach has previously been proposed [17]. 

In this paper, we shall discuss the following inverse problem; suppose that 
only the spectrum of some unknown reference graph is given, can the graph itself 
be reconstructed by using this information only? Finding an exact solution by 
subsequently testing all possible graphs is in practice impossible since the num- 
ber of graphs of even moderate sizes is astronomical. In view of this difficulty, 
we describe a simple stochastic evolution scheme which consists of a consecutive 
sequence of mutations and selections applied to nodes and vertices of a randomly 
chosen test graph. This method is similar to the Metropolis algorithm [18] and was 
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initially discussed in [19]. Specifically, we focus on reference networks correspond- 
ing to relatively large random graphs. For this case, we find that the stochastic 
reconstruction process yields rapid convergence to the spectral properties of the 
reference graph for randomly chosen initial conditions. In addition, other impor- 
tant properties of the random reference graph, such as the clustering coefficient, 
the diameter, and the mean degree, are also well reproduced. 



2. Graph Theory 



The connections or vertices between the N nodes of a graph G are described by 
its adjacency matrix A, where A{j = 1 if the z’th and j’th node are connected, 
and Aij = 0 otherwise (note that diagonal elements satisfies An = 0) [1]. The 
Laplacian spectrum [4] of the graph G is then given by the set of eigenvalues of 
the matrix T, whose elements are given by 

N 

Tij = ^ ^ Ajj — rriiSij ( 1 ) 



where rrii = degree of the z’th node and Sij denotes the usual 

Kronecker symbol. 

Laplacian spectra are closely related to dynamical properties of a simple net- 
work: Consider a hypothetical linear “molecule” consisting of N identical particles 
connected by identical elastic strings. The pattern of connections is defined by a 
graph G: a bond between particles z and j in the network is present if the respective 
element A^j in the adjacency matrix of the graph G is equal to unity and absent 
otherwise. This dynamical system is described by a set of differential equations 
Xi -h ~ ^j) = 0 for coordinates Xi of all particles. Obviously, the 

vibration frequencies uJk of such a molecule (/c = 0, 1 , . . . , A' — 1) are given by the 
eigenvalues of the matrix T. Note that one eigenvalue Ao always satisfy 

A() = 0 due to the translational invariance of this equation. For this reason, the 
Laplacian spectra of a graph are also known as the vibrational spectra [4] . Besides 
yielding a link to dynamical networks, spectra provide a powerful invariant char- 
acterization of graphs, since each graph of size N is mapped into a set of A' — 1 
positive real numbers cui. 

Instead of working directly with the discrete spectrum of a graph, it is con- 
venient to introduce the following spectral density p{uj) given as a sum of Lorentz 
distributions 



N-l 
k=l ^ 



7 

+ 72 



( 2 ) 



where 7 is a common distribution width and AT is a normalization constant chosen 
such that p(cj)dcj = 1. It is also convenient to introduce the spectral distance 
e as a measure of the difference between the spectral densities po(c^) and p(cj) of 
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Figure 1. Map showing the problem of the Bridges of 
Konigsberg [20] which gave birth to the foundation of graph the- 
ory as a mathematical discipline (a). The same problem repre- 
sented in terms of a graph G and its associated spectral density 
for 7 == 0.08 (b). (For colored picture see color plate 6). 



two graphs Go and G 

I ~ 

[po(w) - p{u>)]'^duj. (3) 

The graph and the corresponding spectral density associated with celebrated prob- 
lem of the bridges of Kningsberg due to Euler [20] is shown in Fig. 1. 

The spectral distance (3) provides one measure of the difference between 
two graphs in terms of their spectra. Other characteristic properties of a graph 
which also will be discussed here are the clustering coefficient G, the diameter L, 
and the mean degree D, which are defined as follows: The individual clustering 
coefficients Gy describe the ratio between the actual number of connections E,, 
between the ttu nodes connected to the z’th node and the total number m,{l — 
rrii)/2 of possible connections among them, namely Gy = The (mean) 

clustering coefficient G, which expresses the degree of clustering of a graph, is 
then the average (1/A^) Gy. The diameter of a graph is the maximum of all 

distances d, j in G, where the distance d^J is the length of the shortest path between 
the i’th and j’th node. The diameter therefore corresponds to the greatest distance 
between any two nodes in a graph. The mean degree D is simply the average over 
the N individual node degrees my, i.e. D = (l/N') Xlfli 




3. Evolutionary Algorithm 

Consider now the problem of reconstructing a graph Go from its spectral density 
po(cj) only. The solution of this inverse problem is not unique due to the existence 
of cospectral graphs (topologically different graphs with the same spectrum). How- 
ever, the fraction of such cospectral graphs is very small: for N < 6, there is only 
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one pair of connected cospectral graphs. For regular graphs, one pair of size 12, two 
pairs of size 16, a cospectral quadruple of size 28, seven cospectral pairs of size 25, 
and a set of 91 cospectral graphs of size 35 have been found [3]. Consequently, two 
graphs with identical spectra, will indeed also with a high probability be identical. 
Furthermore, if two graphs have only slightly different spectra, the two graphs are 
typically also similar. 

Finding an exact solution by subsequently testing all possible graphs is in 
practice impossible since the number of graphs of even moderate sizes is astro- 
nomical. Instead we propose a stochastic process where an arbitrarily chosen ini- 
tial test graph undergoes an iterative sequence of mutations and selections. The 
algorithmic structure of the stochastic evolution is defined as follows 

• Initialization. First a random test graph G is generated, and its spectral 
distance e to that of the spectral density po(^) of the reference graph is 
calculated. 

• Mutation. One random node i of the graph G is chosen and all its connec- 
tions are deleted. The new degree nrii of the z’th node is chosen at random 
in [1, TV — 1] followed by the generation of a set of new random connec- 
tions. The obtained graph is denoted by G' and its spectral distance e' to 
the reference spectrum po{uj) is calculated. 

• Selection. The initial graph G is then replaced by the mutated graph G' if 
(i) Ac = e — e' > 0 or (ii) exp{ — Ae/e0) < p, where p is a random number 
in [0, 1]. 

The last two steps of the above stochastic scheme are then applied iteratively 
until a certain threshold of Ac has been reached. Note that a selection parameter 
6 (equivalent to a “temperature”) has been introduced in order to avoid that the 
stochastic evolution gets trapped in a local minimum; an unfavorable mutation 
leading to Ae < 0 may therefore still be accepted with a probability exp{—Ae/eO). 
The scheme is similar to the Metropolis algorithm [18] used in other complex 
combinatorial optimization problems. 

The application of this reconstruction procedure to several different types of 
networks, such as clustered and small- world networks, has been discussed in detail 
in [19]. In the next section, we discuss another application of the above stochastic 
scheme, namely the reconstruction of large random graphs. 



4. Reconstruction of Random Graphs 

As an example of the reconstruction process, we consider a random reference graph 
G() of size TV = 50 with a connection probability po = 0.8. As initial test graph a 
random graph Ginit with connection probability pinit =0.1 is chosen. The spectral 
densities of the two graphs Go and Ginit are shown in Fig. 2. To visualize the 
difference in the structure of Gq and Ginit, their associated adjacency matrices 
Ao and Ainit are shown as insets in Fig. 2. Here the elements in the matrices 
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Figure 2. The spectral densities of a random reference graph 
Gq (solid line) and the initial test graph Ginit (dashed line) for 
= 0.08. The structure of the two graphs are shown by the ad- 
jacency matrices Ao and Ainit in the two insets (each black dot 
corresponds to a pair of connected nodes in the respective graphs). 



are represented by a square array of pixels using either black or white colors for 
nonzero and zero elements respectively. 

The time dependence of the spectral density of the test graph during the 
stochastic evolution both with and without selection is illustrated in Fig. 3a b. 
Here the dotted lines indicate the point where the evolution scheme is initiated. 
In (a), no selection is performed and all mutations are accepted. As expected, this 
choice results in a completely irregular and purely random evolution. In (b), the 
selection step is introduced after each random mutation; after a very short transient 
(of the order 50 iterations), the spectral density of the test graph is already very 
close to that of the reference graph. During the following 3 x 10"^ iterations, a 
“refinement” process takes place, where only a few mutations are accepted; this 
can be seen in (b) by the thin line corresponding to a “false” maximum in the 
spectrum of the evolving test graph, which disappears near t — 3.5 x 10 ^ After 
5 X 10"^ iterations, the final spectral distance e is 0.032, and the spectral density of 
the final test graph is almost identical to that of the reference as shown in Fig. 3c. 
This is emphasized further in the inset of Fig. 3 where the absolute difference of 
the spectral densities is plotted. 

To test the statistical properties of the reconstruction process, we have per- 
formed 10'^ stochastic evolutions each starting from a random test graph with 
connection probability chosen randomly on [0, 1]. Each evolution is stopped after 
5 X 10"^ iterations. The result of this analysis is shown in Fig. 4, where the time de- 
pendence of the mean spectral distance l{t) is shown. The mean spectral distance 
reached by the stochastic evolution is 6 = 0.024. The slope near t = 5 x 10"^ = 0 is 
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Figure 3. The stochastic evolution of the spectral density of the 
initial test graph Ghut without (a) and with selection (b). The 
final spectral density reached by the stochastic evolution (c). The 
absolute difference between the spectral densities of the final net- 
work reached by the stochastic evolution in (b) and the reference 
network is shown as an inset in (c). Parameters are 7 = 0.08 and 
6 = 0 . 002 . 



still slightly negative, indicating that additional improvement of the reconstruction 
can be obtained for a longer series of iterations. 

As illustrated in Fig. 4, other important characteristics of the reference graph, 
such as its diameter L, clustering coefficient C, and its mean degree D are also well- 
reproduced as seen from the time dependence of the mean ratios L{t)/Lo, C{t)/Co, 
and D{t)/D()\ the final ratio of these numbers are close to unity, indicating that 
these characteristic properties of the reference network also are well-captured by 
the approximate reconstruction process. 

As another example of a graph reconstruction process, we have chosen a 
relatively sparse random graph G with connection probability po = 0.1. The asso- 
ciated spectral density and the structure of the adjacency matrix Aq is shown in 
Fig. 5 a. Similar to the previous example, we have performed an average over 10'^ 
stochastic evolutions with randomly chosen initial test graphs and terminate each 
evolution after 5 x 10"^ iterations. The results are presented in Fig. 5b where the 
time dependence of the mean spectral density e{t) and the averaged ratios L(t)/Lo, 
G{t)/Go, and D{t)/Do are shown. Again, the performance of the reconstruction is 
very good, the final mean spectral density is 6 = 5.0 x 10“^, and the final ratios 
of the diameter, clustering coefficient, and the mean degree are all close to unity. 

Similar to the case of the dense random graph discussed above, the last 
stage of the time evolution of the spectral density also indicates that additional 
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Figure 4. Time dependences of the mean spectral distance e{t) 
and of the ratios L{t)/Lo, C{t)/Co, and D{t)/Do averaged over 
10'^ evolutions. Parameters are 7 = 0.08 and 6 = 0.002; for the 
reference graph Lq = 2, Co — 0.802, and Dq = 39.4. 



improvement of the reconstruction can be obtained for a larger number of iterations 
in the evolution. 



5. Discussion 

A simple stochastic scheme for reconstruction of graphs from their spectral prop- 
erties has been described. The approximate reconstruction of clustered and small- 
world networks as well as the exact reconstruction of smaller random graphs 
{N = 10) was considered in [19]. In this paper, two other examples, using both 
large dense and sparse random graph as reference graphs, have been presented; in 
both cases, the reconstruction process produces a graph whose spectral density is 
very close to that of the considered reference graphs. Furthermore, characteristic 
numbers such as the diameter, clustering coefficient, and the mean degree of the 
reference network are also well-captured by the reconstruction process. 

For the two cases studied here, the last stage of the time evolution of the mean 
spectral density e indicates that additional improvement of the reconstruction can 
be obtained if the evolution is continued for a larger number of iterations. This 
seems to be a general feature for the graph structures studied both here and in [19]. 
The exact form and properties, however, of the mean spectral distance e during 
the final and initial stages of the evolution is still not well- understood. 

In all the examples discussed here, the sizes of the reference graphs and test 
graphs have been equal; certainly, it would also be of interest to investigate the 
properties of the reconstruction, especially for test graphs with a size significantly 
smaller than that of the reference system. 
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Figure 5. The spectral density of a sparse random reference 
graph G(). The structure of the graph is shown by the adjacency 
matrices Aq where each black dot represents a pair of connected 
nodes (a). Time dependences of the mean spectral distance e{t) 
and of the ratios L{t)/Lo, C(t)/Co, and D{t)/Do averaged over 
10’^ evolutions (b). Parameters are 7 = 0.08 and 0 = 0.002; for 
the reference graph Lo = 6, Co = 0.084, and Do = 4.4. 



The proposed stochastic algorithm can be considered ais a learning process 
during which a certain network adapts its structure to an external signal. The 
characteristic properties of the external signal has in this sense been stored or 
learned via the intrinsic network structure reached via the evolutionary process. 
Note, that we here only have considered spectra of graphs as the reference in the 
reconstruction process. In principal, the origin of the considered reference signal 
is irrelevant. Certainly, reconstruction and design of networks from power spectra 
of an audio or visual origin could be of importance. 
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1. Introduction 

At the cellular level, networks of coupled biochemical reactions underlie all func- 
tions - from cell cycle to development. These reactions control and co-ordinate 
the cellular functions through regulated activities of the genetic and metabolic 
reactions in a highly repeatable manner, in the face of uncertain external and 
noisy internal environment [1]. The co-ordinated behaviour exhibited by the cell 
in response to a variety of signals and perturbations is achieved through various 
regulatory interactions involving both short and long range feedback loops of the 
positive and negative type. The cell is quite flexible in its use of types of reg- 
ulation and uses various combinations of feedback processes for such activities. 
Nested feedback loops are frequently seen in amino acid biosynthetic pathways, 
where a single common precursor can lead to the synthesis of more than one amino 
acid [2]. Living systems seem to have taken advantage of the variability in func- 
tional dynamics emerging from a few common components by simply connecting 
them in different ways. Given the predominance of negative feedback processes and 
reaction pathways with low connectivity observed in both metabolic and transcrip- 
tional regulatory pathways in E.coli. [3,4], we have studied the canonical three-step 
single negative feedback system with end product inhibition and compared it to a 
structurally more complex system in terms of an additional negative feedback. The 
results of this study of the different topologies of feedback processes, in isolation 
as well as when they are interacting with each other, show the type of dynamics 
attainable by these designs under various genetic and biochemical alterations of 
the pathway. We also evaluate the response of the pathway to perturbation in the 
end product concentration in order to judge the robustness of the dynamics in that 
particular topology. Our approach could be a possible way of studying regulation 
in cellular processes and arriving at a set of minimal designs adopted by nature. 

2. The Model Pathways 

The basic three-step reaction module, shown in Fig. 1, involves the conversion 
of a substrate A into a product C via an intermediate, B, where C inhibits the 
formation of A through a co-operative process by combining with another molecule 
R (Model 1). This type of process is common in gene regulation and is known as 
repressor-mediated repression [5-7] . In Model 2, we consider an additional feedback 
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in the pathway (shown by the dotted line in Fig. 1 ) where the end product, C 




Figure 1. The model three-step negatively-regulated reaction path- 
way. The arrows from A, B and C represent degradation reactions. 



inhibits its own synthesis at the B to C conversion step through the enzyme E. 
Thus, here the three-component reaction module is regulated by two negative 
feedback processes. The utilisation of the end product is considered to follow a 
hyperbolic function in this model. To study the temporal behaviour of these model 
pathways, the coupled differential equations for the processes can be written as: 



dx 

dt 

dt 

dz 

dt 



Model 1: 

= F(z)-a,x+^ 

= X - a‘2y 

= y - a^z - g 



Model 2: 

dx ^ 1 

_ = F(,)_a,x+ — 

dt 
dz 
dt 



- ot‘2y 

y 



l+pz 



- OiAZ - g 



k -\- z 



1 -h (1 -f J ’ 

Here, x, y and 2 represent the dimensionless concentrations of A, B and C re- 
spectively. The parameters are e : 7 for the strength of end product inhibition 
of A by the [CR] complex, assuming the [CR] interaction to be of the Michaelis- 
Menten type with n as the cooperativity of repression; 01 , 02 , and 0.1 for the 
rates of degradation of A, B and C respectively; and, g for the rate of utilisa- 
tion of the endproduct of the pathway in cellular processes which is taken as a 
constant in Model 1 [5]. The additional parameters in Model 2 are e : (3 for the 
rate constant of the enzyme-endproduct inhibition reaction with cooperativity 2 , 
as is observed for the tryptophan biosynthetic pathway [8 10 ], and k for the rate 
constant of the reaction for utilisation of end product in the pathway. For /c = 0, 
the rate of utilisation of 2: is constant (g) as is assumed in Model 1. Numerical 
simulations and bifurcation analysis of the model pathways were carried out us- 
ing XPPAUT and MATHEMATIC A [11]. The basal parameter values, taken from 
the normal (wild-type) tryptophan biosynthetic pathway in E.coli [6,12,13], are: 



F{z) 



1+7 
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= 10, ^ = 4, n = 2, ai = 1, 0^2 = 0.01, = 0.01, (5 = 0.33, and k = 0.0173. 

Below we compare the dynamical behaviour exhibited by the two model pathways 
under basal and altered parameters, and also study the robustness of the two 
models in response to perturbation in end product concentration. 

3. Results 

Both the model pathways show equilibrium dynamics for basal parameter values, 
though they stabilise at different steady states. The Model 2 pathway, being more 
tightly controlled with two coupled negative feedback processes, yields about 200 
times less of end product 2: when compared to Model 1. Thus, both the wild type 
model pathways are stable. 




Figure 2. The effect of increasing 7 on the model pathways. 
Model 1 - Phase Plots showing (a) Stable dynamics for 7 = 
100; (b) Bistable behaviour for 7 = 200. Model 2 - Time series 
showing (c) equilibrium dynamics for 7 = 10, 100, and 1000 are 
shown by the dashed, continuous and dotted line respectively; (d) 
Stable limit cycle for 7 = 3500, /c = 0. 



3.1. Conditions for Achieving Oscillations in the Dynamics of the Pathways 

Model 1: To identify the minimal conditions under which the pathway may desta- 
bilise to induce oscillations, we studied its behaviour on varying 7. Our study 
showed that, at the basal parameter values, as 7 is increased, the steady state con- 
centration of the end product reduces drastically until it loses its stability through 
a sub-critical Hopf bifurcation and the pathway shows bistable behaviour, i.e., co- 
existence of a stable limit cycle and equilibrium dynamics between 7 = 32 and 
37. This region of bistability encompasses a larger range and shifts towards higher 
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values of 7 as is decreased. This indicates an increase in the stability for path- 
ways where the end product is under-utilised. Fig. 2a shows the {y — z) phase plot 
for 7 = 100 and g = \ when the pathway evolves to a stable fixed point from any 
region of the phase space. In the bistable range, at 7 = 200 (Fig. 2b), the pathway 
returns to the stable steady state when perturbed at a small region around the 
fixed point, but spirals out to a limit cycle beyond this region. The dotted curve 
in Fig. 2b is the separatrix that separates the phase space around the steady state 
that leads to these two dynamical behaviours. The region of attraction decreases 
in size with increasing 7 , ultimately collapsing to the steady state, which then 
becomes unstable at the bifurcation point and shows stable periodic dynamics. 
Model 2: The Model 2 pathway exhibited equilibrium dynamics under increasing 
strength of repression (Fig. 2c) with reduced end product concentration. Thus, 
changes in more than one parameter was needed to destabilise this system. Fig. 2d 
shows that the pathway exhibits small amplitude limit cycle oscillations for 7 = 
3500 when k was also set to 0. This condition (i.e., k = 0) allows a direct compar- 
ison of the results with Model 1, and one can see clearly that the addition of the 
second negative feedback increases the stability of the system, since higher 7 is 
required for the steady state to be unstable. The phenomenon of bistability, which 
is seen in case of Model 1, is not observed in this case. 



3.2. Role of Different Parameters in the Stability of the System 

In our analyses of the two models, we see that increasing 7 and g have the potential 
for destabilising the pathways. Here we describe the effect of altering the rates of 
degradation, a\^a 2 ^ 0 L[\ with respect to the rate of utilisation of end product, on 
the dynamics of the system. The two parameter plots shown in Fig. 3 are divided 




Figure 3 . Two parameter plots of the rate of degradation of y (02) 
and g showing stability regions, (a) Model 1: 7 = 10 and (b) Model 2: 

-y = 300. See text for details. 

into two regions - the white space where the steady state is stable leading to 
equilibrium dynamics, and the shaded regions for unstable steady state where the 
pathway shows either stable limit cycle dynamics through Hopf bifurcation (dark 
grey), or, is unstable evolving to a negative steady state (light grey). 

The two parameter studies on Model 1 showed that for other parameters 
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at basal values, changing a\ does not change the stability of the pathway. It 
is only when g is increased, that the pathway shows limit cycle dynamics for a 
range of high values of beyond which it is stable again. On the other hand, the 
pathway requires very high values of g for all values of 0^3 to show Hopf bifurcation, 
indicating that alteration in the degradation rate of 2 : does not aid in destabilising 
the pathway. Interestingly, as shown in Fig. 3a, the pathway can be destabilised 
at basal or lower values of g by increasing a 2 even at basal value of repression 
strength. This indicates that the single negative feedback pathway is sensitive to 
a small increase in the degradation rate of y. The Model 2 pathway, on the other 
hand, does not lose its stability for the basal strength of repression for changes 
in oi, o; 2 , and aa. The plot of Q !2 versus g shown in Fig. 3b is obtained for 
high strength of repression (7 = 300) and /c = 0. This indicates that dual negative 
feedback pathway is also sensitive to small increases in the degradation rates of y 
under some conditions. 

3.3. Robustness of Pathway Dynamics 




Time Tiint 

Figure 4. Effect of perturbation on z in Model 1 at the bistable 
parameter values - (a) small perturbation returns to equilibrium state; 

(b) large perturbation leads to stable limit cycle. 

Biochemical pathways in cells are subject to fluctuations both in the rates 
of various reactions and in the concentrations of the metabolites. To judge the 
robustness of the dynamics exhibited by the two model pathways, we evaluate 
their dynamic response under the effect of sudden perturbations in the end prod- 
uct concentration, z. When the Model 1 pathway is at equilibrium or oscillating, 
instantaneous perturbation of z returns the pathway to its original dynamics. How- 
ever, in the bistable region, the effect of the perturbation depends on the strength 
of the perturbation. For small perturbations, the system returns to its original 
equilibrium dynamics through damped oscillations (Fig. 4a). In case of larger per- 
turbations, the pathway dynamics is pushed from its original equilibrium dynam- 
ics to evolve to periodic dynamics and the pathway shows large oscillations in end 
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product concentration (Fig. 4b). Thus, this pathway can switch its dynamic be- 
haviour from one type (equilibrium) to a qualitatively different type (oscillatory) 
on perturbation or noise in molecular concentrations of the basic reactants of the 
pathway. The Model 2 pathway, when subjected to perturbation, returns to its 
original dynamics for a large range of perturbation strength. Thus, though this 
pathway has the same basic reaction module as in Model 1, the particular design 
of the negative feedbacks allows its dynamics to be highly robust against changes 
in concentrations of the reactants. 

4. Discussion 

From all the above analysis we can see that the pathway with two negative feed- 
back (Model 2) is much more stable than that with a single negative feedback 
(Model 1), both to changes in parameters as well as changes in concentrations. To 
look for the source of robustness in Model 2 pathway, we considered a pathway 
where there is no end product-mediated repression of A, but only auto- repression 
of C. The dynamics of this system appears to be extremely stable for both b 2 isal 
values and variations of the parameters, thereby indicating that this short-range 
auto inhibition process in the two-negative feedback pathway could be the source 
of higher stability. 

During the last few years, with the advent of “forward” engineering of gene 
regulatory circuits [14], simple genetic circuits have been constructed whose dy- 
namics have been compared with the predicted behaviour of the corresponding 
mathematical models. Becskei et al. [15] have constructed a simple genetic circuit 
consisting of an auto- regulated, single negative feedback system and have shown 
that the output of the system is more robust and stable than a similar system 
without any feedback inhibition. Other arrangements of negative feedback such 
as a genetic “toggle switch” [16] having a control circuit comprising of two mutu- 
ally opposing negative feedbacks, and another [17] with three negatively regulated 
genes opposing one another in a sequential manner (the “repressilator” ) have also 
been constructed. The former shows bistable behaviour, i.e. co-existence of two 
stable steady states, and the latter shows oscillations as expected from the corre- 
sponding mathematical models. These studies and our results imply that negative 
feedback alone does not determine the dynamics of the pathway, rather the design 
of the feedback connections determine the functional dynamics. With experimen- 
tal studies on construction of gene circuits in one hand, and detailed theoretical 
models of existing biochemical pathways based on newer genetic, biochemical and 
molecular sequencing data on the other, the future seems exciting for the under- 
standing of the internal organisation of the whole cell. 
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1. Introduction 

Transcription factors and their binding sites have become a focal point of bioinfor- 
matics research since it became clear that regulatory networks are a centerpiece of 
genetic information processing. Understanding how genetic information controls 
and organizes complex biological processes, such as metabolic dynamics, develop- 
ment and morphogenesis [13], is a major challenge which is addressed by current 
bioinformatic modelling. The process of realizing phenotypic properties encoded 
in DNA sequences necessarily involves sequence specific contacts between DNA 
subsequences and molecules with a biological activity. 

All proteins which recognize DNA in a sequence-specific manner do so by 
binding to short sequence words on the DNA double helix. The sites where bind- 
ing occurs are called binding sites, and the words which are recognized by a pro- 
tein are the binding words. Transcription factors are the most important type 
of DNA binding molecules. Consequently, transcription factors and their binding 
sites have received much scientific interest during the last several years. However, 
a theoretical and principled basis for understanding binding site sequences and 
their evolution has not yet been developed. 

The analysis of binding sites developed by Schneider [4, 5] is based on infor- 
mation theory, and thus it recommends itself as a point of departure for building 
such a basis. Schneider et al. observed an approximate equality of the information 
content of binding site sequences (Rsvxi) and the information content of binding 
site positions, calculated on the basis of binding site frequency within the genome 
(i^freq) [4]. However, this observation cannot be trivially attributed to principles 
of information theory; a theory to account for this finding is not available to date. 
With this contribution, we aim to extend the theoretical basis for studying tran- 
scription factors, their binding sites, and their coevolution within the context of 
regulatory gene networks. 

A transcription factor must bind to the sequences at the appropriate binding 
sites in the regulatory regions of its target genes, and at the same time, it must 
not bind to other sites. Thus, the pattern of binding sites on the genome is a key 
property that is evolutionarily selected for, while mutations that do not alter this 
pattern (i.e. which do not create new binding sites and do not disrupt existing 
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binding sites) can be assumed to be neutral, i.e. mutants with identical binding 
patterns are functionally equivalent and hence form a neutral set. 

We investigate the structure of such neutral sets using a maximum entropy 
approach. This provides a theoretical basis for understanding the empirical ob- 
servation that in several cases, the information content deduced from binding site 
sequence sets (i^seq) approximately equals the information content deduced from 
relative binding site abundance (i?freq)- 

2. Model Overview 

2.1. Genome Model 

Formally, a genome is modelled by a vector of letters: Let d = (di , c? 2 , . . . , d;v), -/V G 
N,d, G A denote a genome consisting of N letters drawn from alphabet A. Let 
a = \A\ denote the size of the alphabet. We will consider the case of a = 4 
(nucleotide sequences, A = {A, C, G, T}). Obviously, the total number of genomes 
of length A” is a^. 

Binding sites of transcription factors are short subsequences, or sequence 
words. Let L denote the length of such a word, i.e. the region on the sequence 
which makes contact with the protein and which is thus relevant for sequence 
specific binding. Let W denote the set of all words of length L and K = |W| = 
denote the number of words with length L, where Wi (d^, . . . , 

denotes the word starting at position i within genome d. 

The pattern of binding sites in the genome d is described by a binary vector 
of length N (we neglect the fact that in linear genomes, the last positions N-L-\- 
2 ... A cannot be binding sites): 6 = (^i, ^ 2 , • • • , ^v), ^ {0, 1}, such that bi = 1 

if a binding site begins at position i and bi = 0 otherwise. The number of binding 
sites (i.e. the number of ones in b) is denoted by n. 

2.2. Transcription Factor Model 

The properties of a transcription factor with respect to binding site recognition are 
described by simply listing all words which are recognized by the factor. While this 
approach certainly does not yield parsimonious descriptions of binding properties, 
it ensures that all possible binding behaviours can be represented. Thus, a tran- 
scription factor is modelled by a vector r = (ts^,T52, . . • ,T5^),Sj eyVjTs- G {0,1}, 
where = 1 if the factor binds to Sj and = 0 otherwise. Evidently, the number 

of such vectors r amounts to 2^ = 2^" \ 

The transcription factors described this way can be grouped into classes ac- 
cording to how many different words are recognized by a given factor r. The 
number of words recognized by a factor r will be denoted by k = 

2.3. Maximum Entropy Analysis 

A transcription factor identifies n binding sites out of all A sites in the genome thus 
providing positional information amounting to i^freq = log(A) - log(n). Binding 
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of a factor molecule at a site also indicates that the site sequence is one of the k 
binding words. Considering that there are K words, the amount of information 
about the sequence is jRscq = ^og{K) — log(/c) (see [5] and references therein for 
details). Thus, if i^seq = ^froq we expect 



k = kn 



Kn 

W' 



( 1 ) 



It is easy to see that substantial deviations from this equality are quite possi- 
ble. For example, a transcription factor may accept only one binding word, but 
evolution may nonetheless produce genomes with high binding site densities by 
producing genomes in which the binding word is overrepresented. One may con- 
jecture, however, that the equality applies in a statistical sense, i.e. that it is 
(approximately) valid for a dominant majority of cases. This hypothesis can be 
assessed by maximum entropy analysis. 

Formally, our maximum entropy ansatz is: Let ft{n,k) denote the number of 
(f, d) tuples which realize a binding pattern with n sites on d with a transcription 
factor r that recognizes k different words. Then, the equality i^scq = -Rfroq is 
to be expected if, for a fixed value of n, the maximum of Q{n,k) is reached at 
k = kn = The calculation of Q{n, k) is complicated by the overlap between the 
Wi in d G . For the subsequent analysis, we therefore treat genomes as sequences 
of non-overlapping, and hence independent words w = (ici , rc- 2 , . . . , rc/v) G (.4^^)^ 
for our subsequent analysis. With this approximation. 



n'{n,k) 





•k^’{K - 




(c) 



( 2 ) 



can be derived. In this formula, 

(a) calculates the number of factors binding to k words 

(b) calculates the number of binding site / non-site patterns with n binding 
sites 

(c) calculates the number of genomes, approximated by w sequences, which 
produce a particular binding pattern with n binding sites given a particular 
factor binding to k words. 

Considering that it is clear that f]' Q. However, is approximately 

proportional to Cl and therefore, the values of k that maximize Cl' for a given n 
are suitable to approximate the locations of the maxima in Cl. Details of this 
approximation can be found in [6]. 



3. Results 

Figure 1 shows results of an analysis based on Cl' for genome length N = 10^^ and a 
fixed binding site word length of L = 10 (hence, K = 4^^^ = 1048576). A considerable 
discrepancy to R^cq = ^froq is observed. The surface plot in Fig. 1 may appear 
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log(Omega’(n,k)) 





n 




Figure 1. Top: plot of logio(r2') for a = N — 10^^ and 
L = 10, middle: coordinates of maximal ft' values for each n 
value (diamonds) and graph of kn = ^ (line), bottom: plot of 
the (i^frcq, ^scq) values calculated as described in Section 2.3 (as- 
terisks) and graph of i?seq = ^freq (line). 
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quite even- levelled, but one should notice the logarithmic scale: The Q' values span 
four million decimal orders of magnitude. The probability for observing k values 
other than the one maximizing Q,' for a given n practically vanishes. 

The bottom plot in Fig. 1 reveals the discrepancies between i^scq and i^frcq 
directly. Here, i^freq values were calculated from the n values shown in the middle 
plot, and i^scq values were calculated from the corresponding k values that maxi- 
mize Vt' . The deviation from i?seq = ^freq is particularly prominent in the case of 
large i^freq values. This finding is especially interesting, as binding site frequencies 
are usually in the order of magnitude of 10“^^ or below, so cases of /^freq ^ 8 are 
biologically most relevant. 



4. Discussion 

While previous analyses focused on the genome space exclusively, our model jointly 
considers both the genome space and the space of protein binding behaviours. 
Based on this integrated state space, our analysis definitively shows that i^^eq = 
i?freq cannot be deduced from principles of information theory. Deviations from 
= Rsix\ can be induced by the distribution of protein binding behaviours. 
With our approach to model binding behaviours, considerable deviations from 
= ^fr(>q are observed for genome and binding site sizes in the order of mag- 
nitude encountered in prokaryotic systems. 

Equality, as well as deviations from it, require additional theory for explana- 
tion. We have developed quantitative estimations of the amount by which i^sc q can 
be displaced from These indicate that the displacement may in reality be 

less pronounced than shown in Fig. 1 because the cardinality of the protein space 
is overestimated by the set of all f. Details of this analysis can be found in [6]. 

In an empirical respect, it would be interesting to revisit the cases in which 
~ ^fi('q was observed, paying particular attention to deviations from equal- 
ity and possible regularities detectable therein. Additional insight into the effects 
which DNA binding proteins have on the information content of their binding 
sites may in the future provide a basis for developing improved methods for the 
detection of biologically significant binding sites, and for the analysis and charac- 
terization of regulatory mechanisms and networks. 

In a theoretical perspective, information theory based studies should lead 
to progress in understanding the evolutionary forces shaping protein-DNA inter- 
actions, including selective constraints on the level of regulatory networks. Our 
work may provide a starting point for integrating regulatory network analyses 
with evolutionary perspectives, e.g. by complementing and extending [7], and for 
theoretical research in evolutionary developmental biology, such as investigating 
the evolution of families of transcription factors that organize development [8]. 
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This chapter focuses on cell-cell and cell-substrate interactions and their con- 
sequence for collective phenomena on the multicellular level. These interactions 
occur in many different systems at almost all developmental stages, in-vitro as 
well as in-vivo. The richness of the observed phenomena asks for suitable methods 
to understand basic principles underlying multicellular organization. 

A very prominent example is cell sorting, where cells from different origin, 
after mixing in culture, reaggregate according to their origin. This was first ob- 
served in in-vitro amphibian neurula cells. In the meantime it has also been ob- 
served in-vivo., namely in Drosophila oocyte localization. One mechanism capable 
of explaining sorting is differential adhesion between cells due to specific cell-cell 
adhesion molecules. If for two cell types the strength of the heterotypic adhesion is 
intermediate, between the strengths of the homotypic adhesion, the cell type with 
the weaker homotypic adhesion encloses the cell type with the stronger homotypic 
adhesion. 

In the contribution by Forgacs and Foty cell sorting in embryonic tissues of 
different origin is studied and it is shown how specific microscopic binding energies 
of adhesion molecules may be linked with the macroscopic surface tension of mul- 
ticellular aggregates. By compression of multicellular aggregates between parallel 
plates it was shown that aggregates of embryonic cells can behave viscoelastically. 
From a quantitative comparison with the Kelvin model it is suggested how the 
macroscopic damping constant can be expressed in terms of microscopic parame- 
ters as the effective adhesion strength of an individual bond. Besides short range 
attraction different ways of long range attraction between cells are possible. These 
involve e.g. chemotaxis, haptotaxis, galvanotaxis, contact guidance, and tensio- 
taxis. 

In the article by Bretschneider a model is suggested, how an individual cell 
can modify the extracellular matrix in order to induce a matrix-mediated long- 
range attraction of other cells by biomechanical means. In this approach the ex- 
tracellular medium is modeled as viscous fluid at very low Reynolds numbers and 
each cell as individual object which is capable of friction-dominated movement. 
Cell-cell attraction stops when the cells get into direct contact. 

The last example shows that cells can modify the way they react to an ex- 
ternal stimulus. Such changes can be correlated e.g. with age or environmental 
stimuli. That is, the same cells may behave differently under different conditions. 
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they are able to change their biophysical and biochemical properties. The rules that 
determine these changes often affect the cytoplasm or are manifested in the genes. 
The latter implies development on the evolutionary timescale. With increasing age 
of an organism genetic changes (differentiations) occur that limit the potential of 
cells to specialize in various directions. In general it is argued that the differentia- 
tions are irreversible. In the contribution by Roder and Loffler a contrasting view 
is presented. It is illustrated in a single-cell based model for hemopoietic stem cell 
organization how the dynamic behavior of the stem cell population size can be 
qualitatively explained if differentiation is assumed to be gradual, reversible and 
controlled by the cell microenvironment. 

Microorganisms offer particularly good opportunities to study the dynamics 
of interacting cell sytems. An example is rippling of myxobacteria before fruiting 
body formation takes place. In the contribution by Borner, Deutsch and Bar it is 
suggested that cells turn their direction of movement as they meet, and then be- 
come insensitive for further contacts for the duration of a refractory time period. A 
cellular automaton model is presented and it is studied under which conditions the 
experimentally found wavelengths relate to the ones found in the model. Aspects 
of spatio-temporal pattern formation in multicellular organisms are discussed from 
a more concept ional point of view in chapter 6. 

Cell-environment interactions, environment being either the matrix, chemical 
signals, or cells of the same or of different types are a central issue also in tumor 
growth where such interaction may determine e.g. the tumor morphology and 
growth kinetics (compare chapter 7). 

Together the articles represent a combination of different aspects in the un- 
derstanding of cellular organization. Cells are capable of short time responses (e.g. 
change of motion), long time modifications (differentiation) and reproduction. All 
presented approaches consider the variance in individual cell behavior. 
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1. Introduction 

An emerging theme in developmental biology is the interplay of genetie and generic 
(i.e. physical) mechanisms [2,15]. As the embryo grows its cells progressively or- 
ganize into tissues with vastly differing physical properties extending from liquid 
(e.g. blood) to solid (e.g. bone). In more biological terms, different tissues are 
the consequence of cell differentiation. It is the particular differentiated state that 
gives rise to the specific physical-material properties of the cell. On the other hand 
the emerging physical characteristics feed back into development and restrict its 
outcomes [2]. The eml)ryonic state is particularly convenient for the study of this 
interplay because at this stage the organism is in a state of constant change in- 
volving active cell movement, it has not yet rigidified into more or less permanent 
structures and thus the effects of physical processes are more conspicuous. 

One of the most obvious manifestations of the interplay of genetic and phys- 
ical processes is cell sorting, a morphogenetic phenomenon which is key to the 
ability of diverse cells to segregate to form organized tissues and organs. The re- 
sult of sorting of tissues of different origin (as consequence of differential gene 
expression) is the appearance of well-defined tissue boundaries, which define phys- 
ical formation or pattern. For example, cells of the prospective nervous system 
must differentiate, segregate from prospective skin cells, and move inside the em- 
bryo. Failure to do so results in birth defects (e.g. Spina bifida). Thus incorrect 
physical arrangement of tissues leads to disease. Holtfreter and colleagues shed 
light on this problem [26], dissociating tissues such as nervous system and skin 
primordia into single cells, then mixing them. Although all the cells stuck to each 
other, they did so to different degrees. Skin and neural progenitor cells found their 
respective counterparts in the cell mix to form sorted aggregates, with skin cells 
surrounding a central sphere of neural cells. Analogously, in an initially random 
mixture of endodermal and ectodermal Hydra cells, endodermal cells sort to the 
center of the aggregate and ectodermal cells surround them [17,25]. 

Any pair of tissues, not only those that contact each other in normal de- 
velopment, sort, provided one is more cohesive than the other [21]. Steinberg’s 
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Differential Adhesion Hypothesis (DAH) (1962), postulated, in analogy with im- 
miscible liquids, that different cells adhere to each other with different strengths 
and that the final sorted configuration corresponds to the minimum of interfa- 
cial and surface free energies. Our extensive in vitro studies [6, 7] have confirmed 
the DAH postulates. Computer simulations based on the DAH and the statistical 
mechanics of cohesion and adhesion reproduce the equilibrium configurations of 
sorted tissues [10,14,16]. Recent in vivo experiments [11,12] have corroborated the 
DAH model of sorting. Since Steinberg’s original paper (1962), experiments have 
clarified the molecular basis of DAH. DAH results from the differential expression 
of cell adhesion molecules (CAMs) in cells of different origins [8]. All CAMs have 
specific structures and specialized morphogenetic roles [3,4]. The present work 
provides quantitative experimental evidence for DAH. It shows how a biophysical 
approach based on DAH to study embryonic tissue properties leads to biologically 
relevant quantitative information on molecular parameters, such as strength of cell 
adhesion. 



B 




Figure 1. The schematic representation of the special surface 
tensiometer. See text for details. 
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2. Tissue Surface Tension 

Steinberg’s model implies that the cells, which sort to internal positions must be 
more cohesive. In fluids, surface or interfacial tension distinguishes the strength 
of cohesion. In order to quantitatively evaluate the postulates of DAH we thus 
measured the surface tensions (i.e. the interfacial tension between living tissues 
and the surrounding tissue culture medium) of a number of embryonic cell popu- 
lations [6,7] and tissues composed of genetically transformed cells using a specially 
designed compression plate surface tensiometer (see Fig. 1) 

Spherical aggregates (prepared by the method explained in Foty et al. [7] are 
compressed between the parallel plates in the inner chamber (IC). A circulating 
water bath feeds the outer chamber (OC) with 37C water. The upper compres- 
sion plate (UCP) hangs from the arm (B) of an electrobalance (through a nickel- 
chromium wire, NCW). Raising the lower compression plate (LCP) by turning 
the lower assembly (LA) compresses the aggregate (A) between the two plates. 
Two video cameras (placed at 90 angles relative to each other), one of which is 
attached to a horizontally positioned microscope monitor the shape of the aggre- 
gate. A computer equipped with a National Instruments Multifunction I/O card 
and Labview software for data acquisition and a composite color PCI frame grab- 
ber continuously records changes in the force exerted by the aggregate upon the 
UCP (equal to the changes in the latter’s apparent weight; see Fig. 4) as well 
as the shape of the aggregate. From the equilibrium value of the force and the 
geometric properties of the aggregate the surface tension is obtained by standard 
mathematical methods (i.e. Laplaces equation), as explained in Foty et al. [7]. 

3. Surface Tension and Equilibrium Sorting Pattern 

The surface tension of ordinary liquids contains information on the relative mag- 
nitudes of the cohesive energy between the molecules (the strength of attraction 
between the molecules of the liquid) and the adhesive energy (the strength of at- 
traction between the liquid molecules and those in the surrounding medium, e.g. 
air). In case of living tissues tested in the tensiometer the affinity of binding be- 
tween two cells in the aggregate (the strength of cell-cell adhesion as used in the 
biological literature) is much stronger than the association between the aggregate 
and the surrounding tissue culture medium. Thus, tissue surface tension reveals 
information on tissue cohesivity. Figure 2 shows equilibrium sorting patterns (in 
order of decreasing surface tension) obtained through the random mixing of pairs 
of embryonic chicken tissues. 

On the right of Figure 2 are images generated through cell sorting, when 
adjacent tissues in the surface tension hierarchy are combined and allowed to 
rearrange in vitro. Images are 60 fim optical (i.e. confocal) sections through the 
resulting, flxed structures. Cells from the two tissue sources were stained with 
contrasting fluorescent markers here assigned false colors to represent flve tissues. 
Neural retina (red) envelops liver (blue), which in turn envelops heart ventricle 
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Figure 2. Equilibrium sorting patterns in mixtures of embryonic 
chicken tissues. The surface tension of each tissue is indicated on 
the right in units of dyne/cm. (For colored picture see color plate 

7 )- 

(yellow), which in turn envelops pigmented retina (red), which envelopes limb bud 
mesoderm (green). In each case the less cohesive tissue envelops the more cohesive 
tissue as indicated by the values of the measured surface tensions listed on the left 
in units of dyne/cm [7]. These sorting experiments show the perfect correlation 
between surface tension and equilibrium sorting pattern: more cohesive tissues are 
surrounded by less cohesive tissues in accordance with the postulate of DAH. 



4. Surface Tension and the Strength of Cell Adhesion 

The adhesion apparatus of the embryonic chicken cells discussed above is not 
fully known (they most probably express various cadherins). In order to relate 
the numerical values of the measured tensions to the strength of cell adhesion, a 
biologically relevant quantity, the adhesion apparatus also has to be quantified. We 
have achieved this by the use of transformed mouse L fibroblasts, which in their 
native state do not express any cell adhesion molecules. We have performed sorting 
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experiments and surface tension measurements with genetically manipulated L 
cells that express known quantities of N cadherin molecules. Cadherins form a 
vast family of Ca dependent CAMs [13,22-24]. There are many classes within the 
family (E, P, N, etc.), with distinct biological specificity and expressed mostly by 
cells in distinct differentiated state (e.g. E-cadherin by epithelial cells). Cadherins 
form mostly homophilic bonds: cadherins within the same class (e.g. E-E) adhere 
strongly, whereas cadherins from different classes (e.g. E-P) bind weakly [13]. 
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Figure 3. Sorting in genetically transformed cell populations 
with controlled number of cell adhesion molecules. See text for 
details. The linear size of the aggregate is around 200 /iin. (For 
colored picture see color plate 7). 

On the left of Fig. 3 is a random initial mixture of two embryonic L cell pop- 
ulations transfected with N-cadherins (the green cells express 25% more cadherins 
than the red cells, yellow is the consequence of color mixing; staining is done as in 
Fig. 2). To the right is the final equilibrium configuration 20 h after mixing. Thus 
a relatively small difference in adhesive strength can drive sorting and establish 
tissue boundaries. We expect the tissues whose cells express more cadherins to 
have higher surface tension. Aggregates prepared of the green cells in Fig. 3 in- 
deed have higher tension than those made from the red cells. Such measurements 
allow to determine the effective strength of cadherin binding under near physiolog- 
ical conditions. (Effective refers to the fact that the collective, zipper-like coupling 
of many cadherins on apposing cells [18] could make the binding of n cadherin 
pairs in the zipper stronger than n times the strength of a single cadherin bond, 
which may even not exist under physiological conditions.) As results by Foty and 
Steinberg (to be published) indicate surface tension is proportional to the num- 
ber of cadherins expressed by the cell. Simple dimensional analysis also leads to 
cr = aJN (cr-surface tension, J-effective strength of binding, Wcadherin surface 
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density, a-constant). (Cadherin surface densities are determined using fluorescent 
cell sorters.) A criticism one may have against such analysis is that genetically 
manipulated cells do not perfectly reflect true physiological conditions. However, 
comparison of Figs. 2 and 3 shows that the flnal equilibrium sorting conflgurations 
of wild type and transformed cells are identical. In addition, the quantitative infor- 
mation on the number of CAMs expressed by the transformed cells allows analysis 
of sorting that is not available for chicken embryonic tissues. We believe that the 
combination of studies on chicken cells and transformed cells leads to biologically 
relevant information on cell adhesion. 



5. Viscoelasticity of Living Tissues 

Living tissues are clearly more complex than ordinary liquids. Even though the 
notion of tissue surface tension allows to establish equilibrium sorting hierarchies 
in terms of tissue cohesivity, it reveals nothing on how this equilibrium is reached. 
Figure 4 shows the typical relaxation of the compressive force in a parallel plate 
experiment. It is the equilibrium value of the force that determines the surface 
tension through Laplace’s equation [7]. The approach to this equilibrium as shown 
in the figure is typical for viscoelastic materials. We have modeled the entire relax- 
ation process using a generalized Kelvin model of viscoelasticity [9] and were able 
to determine the viscoelaistic parameters of the chicken embryonic tissues shown 
in Fig. 2 [5]. Similar experiments are under way for the genetically transformed 
L cells. The curve in Fig. 4 can be fitted perfectly (R^ =0.999) with a double ex- 
ponential. Thus the tissue possesses two dominant characteristic time constants 
(the others are negligible), indicating a bimodal relaxation process. Indeed the 
initial decay of the compressive force is typical for a fast elastic response, whereas 
the later segment is typical for a long and slow viscous response. The two time 
constants differ by more than an order of magnitude. 

The Kelvin model represents the viscoelastic system in terms of friction con- 
stants (i.e. viscosities) and elastic constants and the analysis of the data in Fig. 
4 yields the values of these parameters. Simple dimensional analysis relates the 
viscoelastic parameters to biologically important quantities. The friction constant 
/i of the tissue has its origin in the displacement of individual cells in an aggregate 
when the latter is forced to change its shape (as in a compression experiment), ji 
relates force to velocity, thus its units are Nsec/m. It has been shown that cells 
within a compressed aggregate recover their uncompressed cuboidal shape [5] by 
the time the system reaches equilibrium. Since the compressed aggregate (in the 
final equilibrium state) has a larger surface area than the precompressed aggregate 
(which is spherical) some cells from the interior of the aggregate must have moved 
to the surface. This highly viscous motion requires that cells brake and reform 
their adhesive contacts with their neighbors. It is plausible that 1) the stronger 
the bonds and the longer they live the more difficult it is to brake them and 2) the 
more such bonds there are per unit cell surface the more hindered the motion will 
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Figure 4. Characteristic force relaxation curve during the com- 
pression of a chicken embryonic heart cell aggregate. The smooth 
curve overlaid on the data acquired with the electrobalance and 
appropriate software represents a double exponential fit as a result 
of modeling. 

be. Thus, /i = bJNr, with r being the characteristic lifetime of an adhesive bond 
and b a constant. In light of the units of J, N and r (respectively Joule, l/m‘^ and 
sec), this simple dimensional analysis, together with the one presented for a in 
principle determines J and r, provided N is known (which it is for the genetically 
transformed cells). 

6. Summary 

A number of observations indicate that embryonic tissues mimic the behavior of 
immiscible liquids or fluids [20]. Irregularly shaped embryonic tissue fragments 
round up toward a spherical shape. When embryonic tissue masses of different 
origin are in contact one spreads on the surface of the other. When embryonic 
cells that belong to two distinct populations are randomly intermixed one sorts 
out from the other and the final equilibrium pattern is a sphere inside a sphere. 
Measured values of tissue surface tensions correlate with the hierarchical ranking 
of such configurations (see Fig. 2). The kinetics of sorting proceeds via the coa- 
lescence of small islands of cells into larger ones. This process can be described 
quantitatively by standard theories of nucleation in liquids. Steinbergs differential 
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adhesion hypothesis connects the physical interpretation of these phenomena in 
terms of liquid properties with the molecular biological entities underlying such 
behavior (i.e. cell adhesion molecules). The present work uses this dichotomy to 
deduce biologically relevant information from physical characteristics. In partic- 
ular we outlined an approach to derive the effective strength of binding between 
cadherins based on biophysical measurements of tissues surface tension and vis- 
coelastic properties. An example where such information may be of great value is 
tumor metastasis. A primary tumor (e.g. carcinoma), provided it is diagnosed in 
time can in principle be removed surgically. Surgery implies risk and if possible 
is to be avoided. With the progression of the carcinoma adhesion between the 
constituent cells is gradually downregulated [1] and concomitantly cell motility in- 
creases. These factors enhance the metastatic potential of the tumor. A systematic 
quantitative monitoring of cell adhesion, in the advancing cancer along the lines 
described here may establish a criterion for the surgical intervention. 
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1. Introduction 

It has been known for a long time that cells like fibroblasts on collagen can con- 
tract protein gels and align the constituent protein fibres [9]. In addition, the cells 
preferentially migrate along the aligned fibres. This feedback is important in re- 
modelling of tissues under mechanical stress like in bone development (reviewed in 
[8]). Using artificial elastic substrates (Fig. 1) which do not exhibit fibre alignment 
and thus exclude guidance phenomena it was shown that cells also react to changes 
in substrate tension and rigidity (also referred to as ’’durotaxis” by Lo et al. [5]). 




Figure 1. Traction forces cause wrinkling. Human epidermal 
keratinocytes on a highly compliant silicone substrate. Wrinkles 
in between the cells indicate that forces are transmitted over quite 
a distance (cell size approx. 50 /im 

Detailed studies revealed that there is proportional strengthening of cy- 
toskeletal linkages (reinforcement) in response to restraining forces imposed by 
pulling on beads coupled to cell surface integrin receptors using a laser trap [3, 4]. 
This suggests that reinforcement might be involved in tensiotaxis. The present ar- 
ticle describes a hybrid approach to reflect the discrete nature and small number 
of highly specific cell-substrate binding sites (opposed to continuous mechanical 
models, reviewed in [6]) which allows to test this assumption. 
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2. Model 

The model consists of three components: 1) the continuous substrate (matrix) 
which is a two-dimensional compressible viscoelastic fluid, 2) binding sites which 
are integral parts of the extracellular matrix and move by convection, and 3) 
discrete cells which can attach to the binding sites and exert traction forces onto 
the substrate. The net traction force results in cell motion. We assume that the 
cells have a default state where they attach to the matrix and pull towards the 
cell center, which is true for non-moving unpolarized cells. These forces result 
in deformation of the underlying matrix and in the presence of neighbour cells 
the tress held a cell perceives can be quite complex. Since our hypothesis is that 
traction forces depend on the stress in the matrix asymmetries in the stress field 
give rise to an asymmetric force distribution underneath the cells, a prerequisite 
for cell motion, and grow in time. 

2.1. Matrix 

The matrix is described by a modified Stokes equation with an additional elastic 
term. Although the model chosen here only accounts for simple shear flow and 
does not consider flow-rate dependent stress, fluid anisotropies or memory effects 
of complex biogels [7] it captures all of the relevant phenomenological features 
required for an elementary and generic model. 

-kVjp - Po) +CV (V > V) +/UAV -$pV =0 



pressure, dilation vise., shear viscosity, friction, cell forces 

( 1 ) 

p is the matrix density with po = 1 being the average density, V is the ma- 
trix velocity and F.s the sum of forces exerted by the cells on the matrix volume 
element s {s runs through all elements of the triangulation; l/k\ compressibility. 
Lame coefficients (here ( = was used), friction between the matrix and 
the underlying surface). In brief the terms in this force balance equation have the 
following meaning: 

pressure: the force which resists stretching and compacting of the fluid and is 
responsible for relaxation when there are no external forces; dilation (volume) vis- 
cosity: friction due to volume expansion (actually this term vanishes for true 2D 
systems); shear viscosity: internal friction of the matrix; friction: in experiments a 
polymerized matrix usually is created on top of unpolymerized monomer solution 
(e.g. silicone oil) - $ describes the friction at this interface; cell forces: external 
forces imposed by the cells on top. 

Mass conservation within the domain bounded by F (n: normal onto F) is 
ensured by Eq. (2). 



+ V • (pV) = 0 



(2) 
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2.2. Numerical Solutions 

We impose the following boundary conditions: p|r = po- A FEM approach with 
a modified Galerkin method is used (linear base functions). For ID computations 
we apply a pseudo- ID- triangulation (Fig. 2). Since we can validate the numerical 
results using analytical solutions in ID (see appendix) the numerical schemes thus 
are known to be correct when extended to 2D. The density p is constant within 
the elements, the velocity V exists on the nodes of the triangulation. The stiffness 
matrix for V is solved by a optimised conjugate-gradient method. Integration of 
Eq. (2) is based on explicit Euler adaptive time stepping using a flux-corrected 
transport method. 
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Figure 2. Pseudo- ID- triangulation with model cell. Open circles 
denote binding sites, the rectangular box represents a model cell. 
In this configuration there are no fluxes in the y-direction. (For 
colored picture see color plate 8 ). 



2.3. Binding Sites 

Randomly distributed binding sites are integral parts of the substrate and move 
by convection. They are discrete points where forces generated inside a cell are 
transmitted to the substrate. It turns out that the accurate treatment of force 
transmission and the involved discontinuities poses the main challenge in the nu- 
merical simulations (see appendix). So far the distribution of binding sites is the 
only stochcistic element in the model. 

2.4. Cells 

Cells are abstracted as discrete entities having a specific length (ID, see Fig. 2) 
or a circular shape (2D). Wherever a cell covers a binding site it attaches and 
pulls with the force F- towards the cell centre. The net force for K binding sites 
generated internally results in translocation of the cell. With 7 being a parameter 
describing the internal friction of all cytoskeletal elements it’s velocity is 

given by 

K 

y^F'+7V,,„=0 (3) 

i=\ 

K=K(t) stochastically varies in time as the cell explores new binding sites or looses 
old ones under its cell body while moving. 
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2.5. Reinforcement 

The forces acting on a single binding site are assumed to be modulated by the 
stress in the matrix = f{a) * F^, where 




with axx^ ^yy being the components of the transformed stress tensor describing 
tension in the x and y direction if positive. F^ can be considered as the unmodu- 
lated force applied to a rigid surface. Based on experimental results [3, 4] a simple 
linear dependency was considered (see Fig. 3A). As a result neighbour cells develop 
higher forces at the side where they face each other since the matrix in between 
the cells is under tension (Fig. 3B). In turn this results in motion of the cells 
towards each other based on stress mediated mechanical cell-cell interaction. Fig- 
ure 4 shows the stochastic effects of binding site distribution for a spontaneously 
moving cell. 




Figure 3. Force-stress relation. A: The force on a single binding 
site depends linearly on the stress in the underlying matrix. B: 
Two neighbour cells pull stronger on the side where they face 
each other (black arrows: forces on each single binding side, arrows 
below the cell: net force for the left and right side, color of the 
matrix: Va^). (For colored picture see color plate 8). 
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Figure 4. Spontaneous cell motion in ID. A: Plot of the matrix 
density {p — po) for the cell given below. To visualise the stochas- 
tic effect of the distribution of binding sites the cell is blocked 
in movement. As a result the forces are imbalanced and it pulls 
harder on the right site which causes an asymmetric density pro- 
file (Vertical lines in the density plot indicate the cell boundaries 
for the given cell below, arrows denote the net forces for the left 
and right side, color: matrix density). B: Inclusion of cell motion 
counter-balances the asymmetry. (For colored picture see color 
plate 9). 



3. Results and Discussion 

The main purpose of developing the model was to investigate, whether reinforce- 
ment of cytoskeleton-matrix linkages can account for long range attraction of cells 
mediated by a flexible substrate. Figure 5 shows that the model succeeds in sim- 
ulating tensiotaxis mediated attraction of two neighbour cells in ID. Although 
it is noteworthy that in the model both cells move as a single unit once they 
have contact this is probably not significant since real cell-cell contacts involve 
numerous other regulatory mechanisms like contact inhibition. The corresponding 
two-dimensional situation is depicted in Fig. 6 A. Figure 6B demonstrates that the 
same principle also holds for more cells. Again it is stressed, that the model in its 
present state is only valid to describe the initial phase of attraction, i.e. overlapping 
of cells is not prevented. 

The results support the idea that reinforcement of cytoskeleton-matrix link- 
ages can account for long range attraction of cells mediated by a flexible substrate 
and comprises a new type of cell-cell interaction module. Incorporation in the 
framework of existing models which in addition include short range mechanical in- 
teractions [1,2] will allow to investigate its possible consequences for cooperative 
effects in tissue formation. 
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Figure 5. Two cells attracting each other. Time series for two 
cells. T=0.02: The matrix underneath the cells is compacted. 
T=0-2: The cells start moving towards each other. T=0.4: Cells 
contact each other and move together (T=0.8). This is because 
instabilities in the force distribution are more likely to grow for 
two cells in close contact than for a single one. 
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Figure 6. Tensiotaxis mediated aggregation in 2D. A: Attraction 
of two cells in 2D. Cells now are represented by circles with a 
lamella region and traction forces (black arrows, only shown for 
the first time step) point to the cell centre. The colour of the 
matrix codes the stress. (For colored picture see color plate 9). 
Maximum tension can be found in between cells and like in ID 
cells move towards each other. B: Same situation as in A, but now 
with six cells. To exclude effects caused by cells coming to close 
to the domain boundary, a larger multi resolution grid is used. 
Black arrows here denote the matrix velocity. 
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APPENDIX 

We briefly sketch an analytical solution in ID for the special case ^ = 0. Eqs. (1) 
and (2) are simplified and a single point force at location ^ with 0 < <^ < x^nax is 
cosidered to pull to the right (see Fig. 7). 

dx{T' dxV k{p() - p)) = -fp^5^ 

dtpFdApV) = 0 

at the left boundary V(a:=()) is set to 0, and the stress 

^(x = Xr„aa:) '= d^V + fc(po ~ /») = 0. 



(A-1) 

(A-2) 
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Because of cr = const on both intervals and the jump condition 

together with CF{x=Xrr^ax) 0 we have for the left interval (0 < x < ^) 

= -fpiO (A-3) 



and 

d.V=-^{po-p) + ^,piO (A-4) 

12 fl 

Now, we solve Eq. (A-2) for the initial condition p = po at time ^ = 0 while 
p(^, x) = p{t) and we have: 



P = 



k + f 



P‘ 



-,Po]P 



for which the solution is 

p{i) = 



A) 






kpo 



(A-5) 

(A-6) 



from eq. (A-4) follows for the velocity 



'^(^>2;) = —,Pn 



P' l + i(l 









with 0 < X < (^ 



(A-7) 



at equilibrium poo{x) = and the force /poo equals the elastic force /c(po — p). 

Also for the right interval ^ < x < x,nax analytical solutions can be derived (by 
the methods of characteristics, not shown). Fig. 7 shows the stable density profile 
poo (Po = I-, / = 1-,^ = 9-, plotted is p — po) and an transient velocity profile. 




Figure 7. Effects of a force acting on a single binding site. Left: 
Density profile at equilibrium for a single point force acting at 
location x=0.5 and pulling to the right. Right: Corresponding 
transient velocity profile. 
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1. Introduction 

Haematopoietic stem cells (HSC) are characterised by a flexible potential to pro- 
liferate, to produce large numbers of differentiated cells, to self-renew their own 
population, and to regenerate the functional tissue after injury [7, 8]. Although 
these capabilities are well defined, the mechanisms underlying the regulation and 
control of stem cell proliferation, differentiation, and self-renewal are still unknown. 
In contrast to the high experimental effort to investigate these processes only little 
progress has been made to establish a theoretical framework of stem cell organi- 
sation. 

In the following we present the basic idea of a formal model providing a com- 
prehensive concept for understanding the dynamics of the HSC system. The model 
fulfills all criteria defining general tissue stem cells [8] and consistently explains 
a broad variety of experimentally observed phenomena including heterogeneity 
of stem cells with respect to phenotypic markers, repopulating ability or cycling 
activity, microenvironmental dependence of repopulating potential, plasticity and 
reversibility of geno- and phenotypic traits, and clonal fluctuation and competi- 
tion processes. The model is able to describe single cell as well as cell population 
behaviour on time scales from days to years using a single cell based Monte-Carlo 
approach. 



2. Model Description 

The basic idea of the concept is the ability of individual cells to reversibly change 
their properties respectively functionalities, within a range of possible options, 
depending on the influences of the local growth-environment. Switching between 
growth-environments introduces variability in the clonal development. We call this 
reversibility of geno- or phenotypic traits within- tissue plasticity. 
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2.1. Model Assumptions 

To formalise the model we make the following assumptions: (Al) We assume two 
different growth-environments GE—Adind GE—Q. (A2) Each cell is characterised 
by its cell cycle status c, a property a, which describes the affinity of the cell to 
reside in GE — A , and its membership to either GE — A or GE — Q . (A3) Residing 
in GE — A a, cell is assumed to be non-proliferating. In contrast cells in GE — 
proliferate with average turnover time Tc. (A4) Whereas cells in GE — Ahave 
the propensity to increase affinity a, cells in GE — Q tend to decrease a. (A5) 
Cells can change from GE — Ato GE — Q and vice versa with transition intensities 
(probabilities per time interval) a and cu, which depend on the actual affinity a 
and the cell numbers in the system. This implies a competition of cells in both 
growth-environments in a stochastic sense. (A6) The cellular development with 
respect to property a is reversible. Only if a has become critically small the cell 
is considered to have lost the potential to stick to GE — A. Such cells we call 
differentiated. Figure 1 schematically summarises these assumptions. 
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Figure 1. Schematic representation of the model-concept. 



Referring to the haematopoietic system we like to think of two growth-envi- 
ronments in the bone marrow where stem cells can either be attached to {GE—A ) 
or detached from {GE — Q) specific stroma components, resulting in switches be- 
tween different patterns of gene expression and activation of signal transduction 
pathways. Differences in the cycling behaviour of stem cell subsets and the experi- 
mental observation that stroma contact is needed to preserve repopulating ability 
support this interpretation. 
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2.2. Mathematical Representation 

Each cell in the system is described by: (i) the attachment affinity a G <^max], 

(ii) the position in the cell cycle c with 0 < c < Tc, and (iii) an indicator m G {A, ^2} 
specifying the actual growth-environment. Therefore, the state of an individual 
cell 2 : at time t can be written as the following vector: z(t) = {a{t) , c{t) , m{t)Y • 
Simulations are realised by a synchronous update of the state vectors of all cells 
at discrete time steps according to the following rules: 

(1) Transitions of cells between GE — A^nd GE — Q.3xe described by 



P{m{t + At) = A I m{t) = ft) = a = | 
and 



a{t) 

■ Jet 

^max 



0 



for c{t) < gi 



else 



P{m{t + At) = n I m(t) = A) = uj = 

a[t) 

Herein g\ describes the length of cell cycle phase G\. Furthermore, function 
models the capacity of GE—A to assimilate cells. This capacity decreases with rising 
numbers of cells in GE—A representing a limited resource of binding sites. Similarly 
represents the cell production demand with low numbers of proliferating cells 
inducing an activation of dormant cells into cycle. Because no detailed knowledge 
is available about the underlying biological and chemical mechanisms of these 
attachment / detachment processes we modeled them by a general class of sigmoid 
functions to allow for flexibility in the analysis of the system properties. Both 
functions are modulated by the individual attachment affinity a in the sense that 
cells with high a are more likely to change to GE- A whereas cells with low a tend 
to reside in GE — ft (see examples in Fig. 2). 




(a) 



(b) 



Figure 2. Transition functions a and u for different values of a. 
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(2) If cells do not undertake transitions from one growth-environment to the other 
in At they develop inside the actual growth-environment according to the following 
deterministic rules: 



m{t) = Q : 
d(^t “h At^ 

c{t -h A^) 
m{t) = A : 

Oj{t At^ 
c{t + A^) 



J a{t)/d if a{t) / d > dmin ^ y^ith d>l 

^ 0 else 

f c(t) -h A^ if eft) At < Tc 
0 else 

f d(t) • r if d{t) • r < dmax , with r > 1 
\ (^max else 



Cell division at c{t) > Tc is realized by a replacement of the mother cell by two 
daughter cells inheriting the actual state vector. If cells have reached a = 0, they 
are assumed to be founders of a maturing clone producing differentiated cells. 



3. Simulation Results 

A detailed description of all different experimental observations which have been 
simulated (see [9]) is beyond the scope of this article and we only give a short 
overview of phenonerna classes which are covered. 

The model is able to produce qualitatively different growth scenarios includ- 
ing the repopulation of the system from one cell and to compensate repeated 
damages. Steady state situations (with respect to average cell numbers), are dy- 
namically stable. If looking at different regions of the d — r— parameter space 
qualitatively different system behaviours emerge. Whereas small d produces oscil- 
lations, with the limiting case of indefinite growth for d ^ 0, large d result in a 
system exhaustion (see Fig. 3). 

If investigating the contribution of two differently but neutrally marked cell 
populations to blood production, fluctuations have been observed [1]. The obser- 
vation that these are especially pronounced shortly after transplantation can be 
reproduced by our simulations. If the two cell populations have different cell kinetic 
properties, the fluctuating behaviour can turn into a competition with a potential 
replacement of one cell type by the other (e.g. normal and malignant cells in hu- 
man leukaemias [3] or kinetically different cell types in animal chimeras [11]). The 
assumption of different transition characteristics (/^, f^^) or differentiation (d) and 
regeneration (r) parameters allows a qualitative explanation of such phenomena. 

The used single cell based model also offers the possibility to investigate 
competition processes on the individual clone level. Approximating experimen- 
tal sampling and measurement strategies, our model predicts results on clonality 
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Figure 3. (a) Regions of qualitatively different system behaviour 
in d-r-parameter space, (b) Representative realisations of the sim- 
ulation model according to these regions. 



conversion in viral gene-marking experiments. Depending on the scenario, oligo- 
clonality with large long-lived clones [6] as well as polyclonal situations with many 
short-lived clones [4] are detectable. 

Differences in stem cell support between stroma cell lines and the effect of 
direct stem cell stroma contact on in vitro preservation of repopulating ability 
have been demonstrated [12]. The assumption of differences in achievable maxi- 
mum values for the attachment affinity aj^ax (to model differences in stem cell 
supporting quality of the stroma) or an inhibition of direct contact by setting the 
transition intensity o = 0 can explain the differences in the repopulating ability 
of stem cells cultured in different in vitro settings. 

The model is able to reproduce the described variability in colony growth 
assays [10, 5] without assuming a predetermined developmental hierarchy of the 
stem cells. Furthermore, it is well documented that the majority of stem cells is 
non-proliferating under steady state conditions. However, most of the primitive 
cells can be labeled in S-phase [2]. The model is consistent with these data. The 
ongoing activation/deactivation process allows for a sequential S-phase labelling 
of the entire stem cell pool with a high proportion of dormant cells at any time 
point. 

4. Discussion 

The presented model-concept of haematopoietic stem cell organisation focuses on 
regulation and control of self-renewal and reconstituting ability. We suggest within- 
tissue plasticity as an important mechanism with the growth-environment having 
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impact on the actual expression of potential cell properties. In contrast to pre- 
viously proposed models, our concept neither requires assumptions on explicit 
sternness properties nor on a predetermined, unidirectional developmental hierar- 
chy of stem cells. Nevertheless, the system behaviour is fully consistent with the 
definition of tissue stem cells fulfilling all functional assay criteria. These findings 
permit a fundamentally new perspective on stem cell development as a tissue de- 
pendent self-organising process. Instead of focusing purely on cellular properties 
the model suggests to consider the system of cell - environment interaction as es- 
sential to dynamically stabilise distributions of multivalent stem cell populations 
which can flexibly respond to demands. 

We are aware of the fact that our model of haematopoietic stem cell develop- 
ment is simplistic. Feedback mechanisms from more mature cell stages are ignored 
at the moment. The same holds for the spatial component of the bone marrow 
matrix. Furthermore, we believe that lineage commitment processes, which are 
not covered yet, could be interpreted using similar conceptual assumptions on 
within-tissue plasticity and environment dependent fluctuations. A further point 
is the simplification to two growth-environments as well as the description of cells 
by only two functional properties. Microenvironmental influences as well as cel- 
lular properties are certainly high dimensional. However, we aimed to start with 
to most simple model which is still able to meet the criteria of the definition of 
tissue stem cells and, at the same time, covers a broad spectrum of qualitatively 
different phenomena. 

This work demonstrates that the concept of within-tissue plasticity can pro- 
vide a rather comprehensive general class of models. We believe that it can be 
applied to other stem cell system as well as to the description of recently discov- 
ered tissue plasticity phenomena of adult stem cells. 



References 

[1] J.L. Abkowitz, S.N. Catlin, and P. Guttorp. Evidence that hematopoiesis may be a 
stochastic process in vivo. Nat. Med., 2 (1996), 190-197. 

[2] G.B. Bradford, B. Williams, R. Rossi, and I. Bertoncello. Quiescence, cycling, and 
turnover in the primitive hematopoietic stem cell compartment. Exp. Hematol., 25 
(1997), 445-453. 

[3] A.M. Carella, F. Frassoni, J. Melo, C. Sawyers, C. Eaves, A. Eaves, J. Apperley, S. 
Tura, R. Hehlmann, J. Reiffers, E. Lerma, and J. Goldman. New insights in biol- 
ogy and current therapeutic options for patients with chronic myelogenous leukemia. 
Haematologica, 82 (1997), 478-495. 

[4] N.J. Drize, J.R. Keller, and J.L. Chertkov. Local clonal analysis of the hematopoietic 
system shows that multiple small short-living clones maintain life-long hematopoiesis 
in reconstituted mice. Blood, 88 (1996), 2927-2938. 

[5] Q.L. Hao, F.T. Thiemann, D. Petersen, E.M. Smogorzewska, and G.M. Crooks. Ex- 
tended long-term culture reveals a highly quiescent and primitive human hematopoi- 
etic progenitor population. Blood, 88 (1996), 3306-3313, . 




Simulation of Haematopoietic Stem Cell Organisation 



293 



[6] C.T. Jordan and I.R. Lemischka. Clonal and systemic analysis of long-term 
hematopoiesis in the mouse. Genes Dev., 4 (1990), 220-232. 

[7] M. Loeffler and C.S. Potten. Stem cells and cellular pedigrees - a conceptual in- 
troduction, Stem Cells, Edited by C.S. Potten, (Academic Press, Cambridge) 1997, 
1-27. 

[8] M. Loeffler and I. Roeder. Tissue stem cells: Definition, plasticity, heterogeneity, self- 
organization and models - a conceptual approach. Cells Tissues Organs, 171 (2002), 
8-26. 

[9] I. Roeder and M. Loeffler. A novel dynamic model of hematopoietic stem cell orga- 
nization based on the concept of within-tissue plasticity Exp. Hematol. 30, (2002), 
853-861. 

[10] J.E. Till, E.A. McCulloch, and L. Siminovitch. A stochastic model of stem cell pro- 
liferation, based on the growth of spleen colony-forming cells. Proc. Natl. Acad. Sci., 
51 (1964), 29-36. 

[11] G. Van Zant, K. Scott-Micus, B.P. Thompson, R.A. Fleischman, and S. Perkins. Stem 
cell quiescence/ activation is reversible by serial transplantation and is independent 
of stromal cell genotype in mouse aggregation chimeras. Exp. Hematol., 20 (1992), 
470-475. 

[12] J. Wineman, K. Moore, I. Lemischka, and C. Muller-Sieburg. Functional heterogene- 
ity of the hematopoietic microenvironment: rare stromal elements maintain long-term 
repopulating stem cells. Blood, 87 (1996), 4082 4090. 




Function and Regulation of Cellular Systems: 

Experiments and Models, 295-301 

ed. by A. Deutsch, M. Falcke, J. Howard, W. Zimmermann 
© 2004 Birkhauser Verlag Basel/Switzerland 



Pattern Formation in an Interacting Cell 
System: Rippling in Myxobacterial Aggregates 

Uwe Borner^ Andreas Deutsch^ and Markus Bar^ 

^Max Planck Institute for Physics of Complex Systems, 

Nothnitzer Str. 38, D-01187 Dresden, Germany 
^Center for High Performance Computing, TU Dresden, 

Zellescher Weg 12, 01069 Dresden, Germany 



1. Introduction 

It is usual to divide organisms into two basic categories: unicellular and multicel- 
lular. However, looking at unicellular organisms such as bacteria and amoebae one 
can see their clear tendency to form multicellular structures, colonies and fruiting 
bodies [1-7]. Here we focus on pattern formation of Myxobacteria [8, 9]. Myxobac- 
teria are rod-shaped cells that glide (the precise mechanism is unknown) along 
their long axis. They exhibit a complex developmental cycle with individual and 
social phases. As long as there is sufficient food supply, vegetative cells prey, grow 
and divide as individuals or in small swarms. Under starvation conditions, bacte- 
ria start to act cooperatively, aggregate and finally build a multicellular structure, 
the fruiting body. Fruiting body formation is often preceded by a periodic pattern 
originally classified as oscillatory waves [10] and later named rippling [11, 12]. 

An experimental illustration of the rippling phenomenon is shown in Fig. la. 
Bacteria organize into equally spaced ridges (dark regions) that are separated by 
regions with low cell density (light regions). We examine the temporal dynamics of 
the density profile along a one-dimensional cut indicated by the white line in Fig. 
la. The resulting space-time plot reveals a periodically oscillating standing wave 
pattern (Fig. lb). Several experimental studies [10-13] report periodic rippling 
patterns with wavelengths between 45 and 100 pm, wave velocities between 2 and 
11 pm min~^ and temporal periods between 8 and 20 min. Typical cell lengths 
vary between 5 and 10 pm, so that the rippling wavelength corresponds to 10 - 
20 cell lengths. In addition, single cells have been found to move unidirectionally 
with the ripple waves in a typical back- and- forth manner [12]; they reverse their 
direction of motion with a mean reversal frequency of about 0.1 reversals min~^. 

Intercellular communication is essential in order to maintain a complex life 
cycle as exhibited by Myxobacteria. There are at least five extracellular compo- 
nents (known as A-, B-, C-, D- and E-signal) that enable cells to coordinate their 
behavior. The latest acting signal during development is C-signal, presumably 
identical to a cell membrane-bound protein called C-factor [14]. C-factor is pro- 
posed to play the key role for the formation of ripples. Addition of C-factor (which 
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(a) 




time 



Figure 1. (a) Snapshot from a rippling sequence in Myxobac- 
teria taken from a time-lapse movie (H. Reichenbach). Ridges of 
cells (dark regions) are separated by regions with lower density 
(white). White bar: 300 fim. (b) Space-time plot of the density 
profile along the white line in (a). Wavelength is 105 fim, temporal 
period is 10 min. 



can be extracted from rippling cells) increases the mean reversal frequency of cells 
[12]. C-signaling occurs when cells are in end-to-end contact [12]. The C-factor 
protein is encoded by the csgA gene [15]; csgA mutants {csgA~), i.e. mutants 
that carry a mutation in the csgA gene, are unable to ripple and aggregate [16]. 
Experiments with dilutions of C-signal-competent cells with csgA~ cells exhibit 
an increased ripple wavelength [12]. Sager and Kaiser put forward the following 
hypothesis: When two opposite moving cells collide head-on, they reverse their 
gliding direction due to exchange of C-factor [12]. In order to test this hypothesis 
we have designed a mathematical model. 



2. Model 

The discrete model for the formation of ripple patterns is based on the dynamics of 
individual cells. It is defined on a regular cubic lattice assuming discrete space and 
time coordinates, analogous to cellular automaton models (Fig. 2a). The spatial 
lattice constants are chosen in a way that bacterial cells (assumed as equally sized) 
cover exactly one node. Allowed cell positions are (i) directly on the substrate {x- 
y-plane) or (ii) on top of other cells. This reflects the experimental situation in 
which cells glide on a suitable surface and are organized in heaps [11]. The total 
number of cells is constant (absence of replication and death). 
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Figure 2. (a) Exemplary configuration of the model, the cell ori- 
entation is indicated by arrows, (b) The interaction neighborhood 
is a five nodes cross in the y — 2 -plane at the x-position the cell 
is directed to (here the cell orientation is the -f-x-direction). (c) 
Illustration of the migration rules. Cells move one after the other 
(the labeled (*) cell is the next in the row) occasionally lifting 
other cells (1), falling down (2) or slipping away (3). 



The basic rules of the model are derived from the experimental results and 
the hypothesis described in the previous section. We extend this hypothesis by 
assuming cells to be temporarily refractory after reversal. As we will show, this 
refractory phase is the most important ingredient for rippling. 

(i) Movement of cells is confined to the ±x-direction, a variable o G { — 1,1} 
describing the orientation of movement is associated with every cell. Once per time 
step all cells move sequentially (fixed order) to the neighboring node in according 
to their orientation. Exception scenarios are illustrated in Fig. 2c. All cells are 
assumed to glide with equal speed (fluctuations of this quantity are due to a small 
stop probalility p) of one cell length per minute. Thus a time step corresponds to 
roughly one real-time minute. 

(ii) After the migration of all cells synchronous interaction takes place. In 
the model, head-on-collision is defined as the existence of at least one cell B with 
opposite orientation in the (orientation-dependent) neighborhood of cell A (Fig. 
2b). We assume cells to be either sensitive or refractory. Refractory cells do not 
respond to C-factor, i.e. only sensitive cells reverse if invoked in collisions. After a 
cell has reversed it is temporarily refractory. 

At the end of the next section we will discuss the effects of the following model 
extension: A second cell type that represents the csgA mutants is introduced. Be- 
cause these mutants fail to encode the C-factor protein they do not induce reversal 
of other cells in collisions. The mutant cell itself can receive C-signals and reverses 
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Figure 3. (a) Simulation snapshot in a system of size 100x50 
containing 15000 cells with a refractory phase r = 1 time step 
after ca 5000 time steps (black corresponds to high cells columns). 

(b),(c) Same as (a) with r = 3 resp. r = 5. In (d),(e),(f) we show 
the corresponding space- time plots along 100 substrate sites in 
x-direction over 100 time steps. 

after collisions with non- mutant cells. Reversal of mutants also results in tempo- 
rary refractoriness; here we assume equal duration of this pligise for mutants and 
non-mutants. There are no transitions between the mutant and the non-mutant 
cell type. 

3. Simulation Results and Discussion 

The duration of the refractory phase r turns out to be the most important quan- 
tity for ripple formation. Regular patterns for refractory times r > r,. 4 time 

steps can be observed. Two counter-propagating travelling waves of about equal 
amplitude form a standing wave, in agreement with the experimental observations. 
For refractory times r < r,. we still observe pieces of waves without long-range cor- 
relations in space or time; for vanishing refractory time cells exhibit fluctuations 
near a homogeneous density state (Fig. 3a-f). Wavelength and wave period of the 
simulated ripples increase with r [17]. The experimental values of wavelength and 
temporal period of the rippling pattern (Fig. 1) are reproduced with a refractory 
time of 4-5 min. 

Our results show that cell reversal after head-to-head collisions between cells 
in an appropriate mechanism for ripple formation and maintenance only if it is 
supplemented by a refractory phase. So far there is no direct experimental proof 
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Figure 4. (a) Simulation snapshot in a diluted system (60% 
wild- type cells, 40% csgA mutants {csgA~)), r = 5. (b),(c) Same 
as (a) with 60% resp. 80% csgA~ . (d),(e),(f) Corresponding space- 
time plots. 



for the existence of a refractory phase. However, experiments with bacterial cells 
exposed to isolated C-factor revealed an increa^sed absolute reversal rate of roughly 
0.3 reversals per cell and minute [12]. This can be interpreted as an upper bound 
for the reversal rate resulting from a refractory phase of ca 3-4 min. 

Little is known from experiments about the height of cell heaps during rip- 
pling. In the latest experiments approx. 10^’ cells formed a circular spot of 6.5mm^ 
[13]. Assuming an average cell area of 16^m^ (length 8/im, width 2/im) this cor- 
responds to 2.5 cell layers, neglecting that the cell density is higher within the 
edges [13]. The results presented here are obtained for an average number of n = 3 
cells per substrate site. However, the results depend only weakly on the number of 
cells in the aggregate; variations of n between 2 and 10 do not produce significant 
changes. 

One advantage of our cell-based model is the possibility to mark and track 
single cells. In the simulations cells are found to move about a distance of half a 
wavelength before reversing [17], perfectly reflecting the back- and- forth movement 
of cells in the experiment. In fact, travelling waves reflect each other leaving the 
crest shape unchanged. While most of the cells ride with the crest and are reflected 
when the crest collides with a crest moving in opposite direction, occasionally a 
cell does not And an interaction partner and continues travelling in the original 
direction. 

Patterns produced in mutant-diluted systems are shown in Fig. 4. For a fairly 
large fraction (40%) of C-signal-defective mutants (Fig. 4a,d) the pattern is still 
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recognizable and regular with long correlation lengths in space and time. With 
further increasing frequency of mutants the pattern becomes irregular, nevertheless 
with short-living wave crests. In the simulations the wavelength of the ripples does 
not significantly change with altering mutant fraction, whereas experiments report 
a strong increase of the wavelength for mutant fractions larger than 50% [12]. 

We presented a discrete model which is designed to find out what is essential 
for rippling. It is simple but also simplified. On the basis of the refractory period 
one can now develop a more realistic model (which reproduces the experimental 
results with mutants, for instance). Other models for the formation of rippling 
patterns have been proposed. A continuous model [18] with similarities to our 
model contains additional assumptions - like an internal biochemical clock which 
may play a role but is not essential for rippling as our results show. Another 
continuous approach [19] does not assume refractoriness and predicts characteristic 
density dependent reversal rates. Both models disregard the discreteness of the 
interacting cells which will be important at small cell densities. 

Our study strongly suggests experiments with single cells to verify the refrac- 
tory hypothesis and to elucidate its biochemical basis. Moreover, myxobacterial 
rippling provides the first example of pattern formation mediated by migration 
and direct cell-cell interaction, that may also be involved in myxobacterial fruiting 
body formation as well as selforganization processes in other multicellular systems. 
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What Might Be Expected from a Systems Biology? 

There are many problems in biology and medicine which are connected with the 
organisation of tissues. The change in form and shape of embryonic tissues is surely 
one of the most fascinating wonders of this world. By “organisation” it is generally 
understood that there is a hierachy of biological or biochemical objects constituting 
the next level in a series of scales. Can these scales be “naturally” chosen? With 
respect to the lowest microscopic scales the answer is not yet clear. But without 
any doubt the cell is the fundamental unit of biological tissues, especially and most 
importantly from an evolutionary point of view. Theoretical studies have to take 
this level of organisation into account, separating the biological world into spheres 
below and above the organisation of cells. Hence the title of this chapter. 



What Can Be Found in this Chapter 

Most problems of developmental biology have to do with cell differentiation trig- 
gered by signaling molecules [7,14]. Early experiments already suggested that the 
formation of a body axis is controlled by organisers^ a small assembly of cells that 
is able to influence the fate of cells in its neighbourhood. This led to the idea of 
pattern formation by an activator-inhibitor mechanism, first developed by [12]. 
In the meantime much experimental knowledge about the involved molecules has 
been obtained. An overview over recent findings for the model organism Hydra 
can be found in the contribution by Technau et al. in this chapter. Hydra is an 
example in which the development of theoretical models had strong influence on 
experimental design, starting with the work by Gierer and Meinhardt [3]. Gierer 
and Meinhardt suggested a generic pattern forming mechanism for head and food 
formation based on a diffusion-driven Turing-type instabilty heis been formulated 
with the help of reaction-diffusion equations. By coupling several pattern-forming 
reactions more complex patterns (for example inducing tentacle formation) can 
be obtained as is demonstrated in the contribution by Meinhardt in this chapter. 
Besides the importance of basic principles and the functioning respectively robust- 
ness of developmental organisation it has to be realised that all body structures 
have been shaped by evolution. Comparative studies like the one presented by 
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Olsson will help to understand the hierarchy and re-organisation of developmental 
programs on the evolutionary time scale. There is a mutual benefit with theoreti- 
cal modelling to be expected, first because development is surely not a matter of 
pure optimisation, but also one of redundancy and re-use. Secondly phylogeny will 
enable to recycle basic modelling principles for different model organisms, perhaps 
with a different degree of complexity. 
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Figure 1. Developmental systems based on cellular interaction 
need to integrate over micro-scales and determine properties on 
macro-scales (phenotype). The cellular scale is marked with an 
arrow. 



Besides reaction-diffusion models which average over cellular interaction, 
many developmental problems are related to cell fate and local division of labour 
among different cell types. Such problems can best be studied by models that are 
located at the cell level, see Fig. 1. A prominent example is the model organ- 
ism Caenorhabditis elegans (just honoured by a Nobel prize for S. Brenner, J. E. 
Sulston and E. R. Horvitz) which consists of 959 cells, with 131 eliminated by 
apoptosis during development (for an overview to apoptosis and its role in devel- 
opment see [6]). Upscaling techniques have to be derived to find bridges between 
different scales. An important contribution can be expected by cell-based mod- 
els which is inspired by both modern mathematical tools and new experimental 
designs, see the contribution by Kirkilionis in this chapter. 
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Challenges and Benefits: Some Questions to be Answered 

The first major advantage of all theoretical modelling is that different aspects of 
a scientific problem can be brought into a coherent framework. It is a basic step 
before quantitative aspects of an experiment can be compared with hard-wired 
concepts. But it should be noted that the question “Can developmental systems 
be abstracted mathematically?” still has to be answered for various aspects of 
development, and the kind of modelling being adopted is crucial for the further 
development. If successful models can be obtained at least for subsystems, the 
benefit is an implicit knowledge of the degree of complexity minimally needed to 
simulate and possibly “engineer” living tissues. In the following a list of decisive 
questions is presented, concluding this introduction. Each of these questions is 
briefly discussed. Their (partial) answers should strengthen the interplay between 
theory and experiment. 

How can the test of models of developmental biology be improved? 

There is a need for more decisive experiments to better distinguish between model 
variants. Models should be tested by the predictions they make, and predictions 
must be coupled to something measurable in the laboratory. Of course concep- 
tual modelling has still its place, but the time seems to be mature to enhance the 
confrontation of models with laboratory experiments. Like in other areas, models 
should be able to predict various experiments simultaneously. For example, the 
different outcomes from grafting experiments performed with Hydra can still not 
all be predicted by a single model. 

What are absolute sizes and what relative proportions in an organism? 
Typically, in development the right signal is needed at the right time, with a 
subsequent stabilisation of the decisions being made (for early Drosophila see [5]). 
In many organisms the size of the body or body organs can vary enormously 
with environmental conditions, still relative proportions have to be maintained. 
Nevertheless, some parts of the tissue seem to be not scale invariant, as the size 
of the organisers in Hydra (see Technau et aL). 

What makes developmental systems so robust? 

As already mentioned, there is a remarkable robustness of developmental programs 
against perturbations, for example against variations of initial conditions and/or 
parameter changes. Is there a mathematical structure in all models that can exhibit 
such robust behaviour? (For example feedback loops, see [2]). Where exactly can 
robustness be found in experiments? (See for example [1]). 

How much do we know about the genotype-phenotype relationship? 

This is an equally large problem in development £is it is in population genetics. 
Surely the genes do not determine all developmental programs, also the body 
structure and environmental conditions present at each point in time are creating 
boundary conditions which cannot be neglected (A critique very much emphasized 
by B. Goodwin, see for example [4]). Although the mapping between genotype and 
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phenotype is very complex, “genes must be put to their place” [8] . Some attempts 
have been made which assume patterns as phenotype and simple networks as ge- 
netic switches, see Salazar- Ciudad et al. [9-11]. 

Evolutionary considerations: Why is the organism organised like it is? 

With a further understanding of the genotype-phenotype relationship there should 
be more insight into how very similar (evolutionary preserved) genetic networks 
can create a variation of phenotypes by taking over new functions; see [13] for an 
overview of ideas which should be partially accessible for modelling. 
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1. The Hydra Head Organizer 

One of the first events to occur during early embryonic development is the defini- 
tion of the body axes, which provide positional cues to the cells for the differentia- 
tion into region-specific tissues. For this to happen “organizer” tissues are specified 
in defined areas of the embryo that are able to instruct and recruit the surrounding 
tissue to contribute to the formation of a particular structure. The best studied 
example of such an organizer tissue is the dorsal lip of the frog embryo, which 
is able to induce a secondary body axis when grafted to the ventral side of the 
embryo [38]. Later, it has been demonstrated that the node in chick and mice has 
the same property (for review see [16]). The discovery that the same set of genes 
are active in the organizer of all vertebrates further suggested that basic features 
of the organizer might be conserved. However, it was unclear when and how the 
organizer and its molecular composition did arise during animal evolution [16,21]. 

Interestingly, organizers are not restricted to vertebrates. In the freshwater 
polyp Hydra, Ethel Browne (1909) showed in elegant experiments that a small 
piece of tissue from the hypostome, the most apical tip of the animal, is able to 
induce a secondary body axis, when grafted laterally to another polyp [5]. Hence, 
the hypostome is the equivalent of the Spemann organizer in amphibians (see 
also [4]). Hydra is a member of the diploblastic Cnidarians, which belong to the 
most simple living metazoans. Cnidarians evolved about 700 million years ago 
and represent the first animals with defined body axis and a nervous system. The 
fact that Hydra's head tissue exhibits a strong axis inducing capacity suggests 
that organizers are a basic feature of all metazoans which evolved already very 
early during metazoan evolution. Since Hydra's body plan can be also completely 
reestablished from dissociated and reaggregated single cells, organizers can be set 
up de novo, i.e. by self-organisation in a tissue without inherent asymmetry or 
external cues. 
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2. Molecules of the Head Organizer 

Several transcription factors, e.g. the HNF3f3 homolog hudhead [30] the homeobox 
gene goosecoid [4], and the T-box gene Brachyury [40] that play a role in the 
organizer of vertebrates have been isolated from Hydra and all of them are also 
expressed in the organizer region in Hydra (for review see Galliot, 2000). Yet, until 
recently, no signalling molecules have been identified from Hydra that could act as 
diffusible morphogens. In the vertebrate organizer, two major signalling systems 
are responsible for the function of the organizer: one is the Wnt-Wg signalling 
cascade and the other is the TGF-/3 / BMP antagonist Chordin [7,8]. 

We cloned a Wnt ligand (HyWnt) and the cytoplasmic mediators Dishevelled 
{HyDsh), GSK3 (HyGSK3), and /?-Catenin (Hy/3-Cat). In a two- hybrid screening 
with a Hydra /9-Catenin bait, we also identified the transcriptional co- activator 
Tcf (HyTcf) [18]. A Hydra member of the family of Frizzled receptors was recently 
published [33]. In situ hybridization revealed that Wnt signalling acts in axial 
patterning in Hydra. HyWnt is expressed in a small number of about 50 epithelial 
cells in the apical tip of the hypostome which represents the Hydra head organizer 
(Fig. 1). expression is also restricted to the hypostome of the polyp, but the 

HyTcf expression domain is broader than the HyWnt spot comprising the entire 
hypostome, and thereby possibly demarcates the range of action of the HyWnt 
ligand ([18]). In the budding zone, where the new body axis of the daughter polyp 
is initiated ([34]), activation of the HyWnt pathway starts with an upregulation of 
Hy/3-Cat and HyTcf, and is followed by HyWnt expression in a spot of 10-15 cells 
(Fig. 1). Similarly, during head regeneration after decapitation, HyWnt, HyTcf 
and Hy/3 — cat are among the earliest genes to be upregulated within 30 min after 
wound healing [18]. Taken together, expression analysis during regeneration and 
budding indicate a pivotal role for the members of the Wnt-pathway in setting up 
the Hydra head organizer. 

The second major signalling system involved in early embryonic axis for- 
mation of vertebrates, i.e. the TGF/3/Bmp signalling pathway and its antagonist 
Chordin, is also present in cnidarians [19,24,35]. In Hydra, a Bmp ligand (B. 
Reinhardt and H. Bode, personal communication), a highly conserved receptor- 
regulated Smadl homologue [19], and the Bmp antagonist Chordin (our unpub- 
lished results) were found. In situ hybridizations of Hy Smadl and Chordin mRNA 
are consistent with the hypothesis that Bmp signalling is suppressed by Chordin 
in Hydra tissues exhibiting a high morphogenetic activity, which would explain a 
similar function in higher metazoans [8,37]. 

These data demonstrate that a core Wnt signalling pathway as well as the 
TGF/3/Bmp signalling pathway and its antagonist Chordin are present in Hydra. 
Wnt and TGF(3 signalling clearly existed in the common ancestor of diploblastic 
cnidarians and the triploblastic Bilateria and hence most likely were a basic feature 
of early multicellular animals. 




Molecular and Cellular Analysis of De Novo Pattern Formation in Hydra 311 







Ficjure 1. HyWnt exhibits a head- and bud-specific expression 
patterns in whole mount in situ Hybridisations. HyWnt is ex- 
pressed in a group of 15-50 cells defining the head organizer region 
at the tips of hypostome and a evaginating buds. 

3. A Reaction-Diffusion Model of the Hydra Head Organizer 

On the theoretical level, pattern formation and positional signalling in Hydra [2,45] 
was explained in terms of a reaction-diffusion model by Gierer and Meinhardt 
[12,43]. According to this model, a short range activator and a long range inhibitor 
form a feed-back loop, which leads to a local activation of the tissue destined to 
form a head and a lateral inhibition of further activating centers in the vicinity. 
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It was postulated that both, activator and inhibitor are produced in the head 
and transmitted to the body column [12,27-29,31,32]. The “head activation gra- 
dient” (also termed source density gradient; [32]) maintains axial differentiation 
and can be measured as a gradient of head formation capacity in transplantation 
experiments [29]. 

The head inhibition gradient prevents head formation in the body column 
and can be also measured in transplantation experiments [28]. This model can 
explain organizer formation during reaggregation. After dissociation into a single 
cell suspension and subsequent reaggregation, all existing gradients of the polyp 
are destroyed and have to be reestablished [41]. Aggregates are therefore an ideal 
experimental model system to study self-organisation. The theoretical model as- 
sumes that random fluctuations in the activation level in the aggregate can lead 
to an ampliflcation of small local peaks resulting in the definition of a new head 
organizer. 



4. De Novo Formation of the Head Organizer from Single Cells 

One of the basic questions in the theory of self-organisation is, what is the minimal 
size of such random fluctuations in order to act as an organizer, or in other terms, 
what is the minimal size of an organizer. To address this question, we modified 
the classical approach of dissociation and reaggregation [13] by adding clusters of 
different numbers of labelled cells with an elevated level of head activation to an 
aggregate. The labeled cell clusters were produced from regenerating tips which 
have a maximum competence for head induction [42]. 

The tips were dissociated into single cells, aggregated in rotary culture, and 
the resulting cell clusters were fractionated by size [42]. When small labeled cell 
clusters consisting of 10-15 cells (60-120 /iin in diameter) were added to the carrier 
cell suspension, about half of them were found in a developing head (Fig. 2 A, B). 
Since the clusters contained only 0.2 - 4 % of the cells in an aggregate, the high 
frequency of labeled cell clusters in developing heads was not random indicating 
that they are involved in head induction. Indeed, the clusters remained confined to 
the hypostome while the tentacles were formed by the host tissue. This shows, that 
a cluster had instructed and recruited surrounding host tissue to the formation of 
a new head, which is the definition of an organizer sensu strictu. Single cells or 
very small clusters (30 (im in diameter) consisting of one or few epithelial cells 
have virtually no elevated capacity of induction, while clusters of about 90 fim. 
diameter containing about 50 epithelial cells show maximum inductive capacity 
(Fig. 2B). Thus, a minimal number of 10-15 cells is neccessary and sufficient to act 
as a head organizer. These data suggest that a “community effect” [15] between 
these cells is essential to create a stable signalling center. 

Besides the minimal organizer size, another crucial parameter in self-orga- 
nisation models is the activation range. Reaction-diffusion models predict that 
only cluster sizes that are similar or larger to the activator range will amplify and 
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form stable signalling centers. Clusters that are smaller than the activator range 
will not be amplified since they lose activator too rapidly to the environment by 
diffusion, and clusters larger than the activator range will not have any further 
advantage in inducing heads. Therefore a maximum of activation increase can be 
expected as soon as the cluster size is larger than the diffusion range of activation 
(see also [16,17]). We could deduce the range of the activator from our aggregation 
experiments [42]. Based on the experiment shown in Fig. 2 one can estimate that 
the range of activation is probably not much larger than the radius of a 90 /am 
cluster, which would be 45 jam or about 2-3 epithelial cell diameters. This range of 
activation is consistent with the estimated diffusion range of known morphogens, 
i.e. wnt/wingless in Drosophila [14]; (see below). 



5. Autocatalytic Short-Range Activation by the Wnt-Pathway? 

The fact that an activation-range of about 45 /am is neccessary and sufficient to 
stably amplify an organizer indicates that diffusible morphogens like Wnt play 
an instructive role in the activation process. We therefore examined the expres- 
sion pattern of HyWnt in early reaggregates. HyWnt is first expressed in small 
spots comprising only a few epithelial cells (Fig. 3) by 24h. At this time cells have 
completely sorted out into ectodermal and endodermal layers [41], indicating that 
HyWnt activation requires an intact epithelial tissue. By 96h, the HyWnt expres- 
sion domains have enlarged to their final size in future hypostomes (Fig. 3). The 
size of early HyWnt spots is 50-60 ^m, which corresponds to the minimal cluster- 
size that can act as an organizer (see Fig. 2B), while late HyWnt domains are 
similar in size (150-200 pm) to the large clusters which can induce a head. 

The reaction-diffusion model predicts an autocatalytic feed-back loop during 
the activation process. Preliminary data allow us to propose a feedback control on 
the HyWnt pathway. First, HyWnt might activate and stabilize its own expression 
directly via its transcriptional mediators Hy/3-Cat and HyTcf. During reaggrega- 
tion, Hy/3-cat and HyTcf are expressed uniformly throughout the aggregate and 
become later restricted to domains where new heads are being formed. Notably, the 
expression of HyWnt always preceded the apparent restriction of domains in the 
initially symmetrical environment of an aggregate, and all HyWnt domains finally 
form a head [42]. Of course, subtle differences in the expression level of HyTcf and 
Hy/3-Cat, not detectable by the in situ hybridisation method, might possibly define 
the HyWnt spots, which in turn enhance upregulation of HyTcf and Hy/3-Cat in 
the surrounding tissue. Alternatively, an ubiquitous, but high level of HyTcf and 
Hy/3-Cat might provide a competence to the cells to produce HyWnt The activa- 
tion of HyWnh however, might be a stochastic process, which is initiated in single 
cells, but only maintained if by chance neighbooring cells also express HyWnt. Both 
scenarios are consistent with the idea of an autocatalytic feedback loop and that 
HyWnt is a direct target gene of an active Hy/3-Cat/HyTcf complex. In Drosophila^ 
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Figure 2. Head induction by activated cell clusters. (A) A two- 
headed aggregate (96 h) with a head containing a green-labeled 
60 uin cluster of aggregated cells from dissocoated 12 hr regener- 
ating tip tissue. Bar is 200 ^m. (B) Efficiency of cell clusters to 
induce head formation. Head formation frequency of single cells 
(30 //in ) and different cell cluster sizes were scored 80 hr af- 
ter aggregation in carrier tissue derived from whole polyps (filled 
circles, hatched line) and polyps lacking the upper fifth (filled 
triangles, solid line) (p<0.001, n =28-53; means (s.e.m., 3 exper- 
iments)). Control clusters derived from the corresponding carrier 
tissue indicated by open circles and open triangles. (C) Effect of 
head inhibition in aggregates. Aggregates that contained compet- 
ing 120 fim and 60 //m cell clusters in a single aggregate. 120 /xm 
cell cluster inhibited the formation of a head from the 60 /xm cell 
cluster; the correlation of head formation frequency of 60 /xm cell 
clusters with their distance from the nearest head is shown. (A-C 
from Proceedings of the National Academy of Sciences USA 97: 
12127-12131; PNAS, 2000) 
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Figure 3. Expression dynamics of HyTcf, HyWnt, and HyBral 
during aggregate development. In situ hybridization reveals pat- 
terning events during head organizer formation. HyWnt and Hy- 
Bral appear simultaneously in small spots (24h) which enlarge 
during later stages (96h) , and precede formation of morphological 
head structures by about 2-3 days. All spots eventually develop 
into heads. (From Proceedings of the National Academy of Sci- 
ences USA 97: 12127-12131 and Nature 407: 186-189; ©Nature 
Publishing Group, 2000; ©PNAS, 2000). (For colored picture see 
color plate 11). 



autocatalytic self- activation of Wg (the Drosophila Wntl homologue) and a func- 
tional Tcf-binding site in the Wg promoter have been demonstrated [25,44]. 

We have additional evidence that HyWnt might be also coupled by a posi- 
tive feedback with another early head gene, HyBral, a Hydra homologue of the 
T-box gene Brachyury [40]. In aggregates, size and time of appearance of small 
Hy Bra 1-positwe spots are equivalent to the HyWnt expression dynamics. Inter- 
estingly, HyBral also shows synexpression with HyWnt during budding and head 
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Figure 4. Putative positive feedback in Hydra Wnt signaling. 
Preliminary evidence and comparison with higher metazoans sup- 
port the view that direct, autocatalytic self- activation and indirect 
feedback between HyWnt and the transcription factor HyBral are 
involved in establishment and maintenance of HyWnt signaling 
(for details see text). 



regeneration as well as in adult polyps, although the Hy Bra 1-positive domain in 
the steady state hypostome is broader than the HyWnt-positWe domain [40]. A 
putative Tcf-binding site has been recently identified in the HyBral promoter (un- 
published data) , which supports the idea that Brachyury and Wnt are members of 
a synexpression group in Hydra. In mouse embryos and mouse cell lines, Brachyury 
is a direct target gene of Wnt3a signalling [10,26], and Brachyury itself again acti- 
vates transcription of Wntll in Xenopus [39]. Direct experimental proof for such 
a feedback loop in Hydra would be of particular importance. 



6. Long-Range Inhibition from the Head Organizer 

The second major component in the reaction-diffusion mechanism is the inhibitor 
which is produced by the activation center and transmitted to the surrounding 
tissue to prevent the initiation of another activation center [12]. We estimated 
the range of inhibition in a competition experiment by introducing cell clusters of 
different size into an aggregate [42] where larger cell clusters (120 /am) exerted an 
inhibitory influence on the smaller clusters (60 /am). By measuring the distance of 
small clusters from the nearest head the range of inhibition could be estimated. 
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About 50% of the small clusters were not involved in head formation at a distance 
of 600 //m, while essentially all of them were in heads at 1000 ^m (Fig. 3C). This 
indicates an eflFective range of inhibition of about 800-900 fim. By comparison, the 
range of activation, as deduced from the size of cell clusters able to induce head 
formation, was about 45 /am, which is easily 20x shorter. These results are in good 
accord with previous calculations based on the reaction-diffusion model by Gierer 
and Meinhardt (15 fold) and on Mac Williams proportion-regulating version of the 
model (>20 fold) ( [28,29]). 

An important prediction of the Gierer and Meinhardt model is that a rise of 
inhibition is tightly linked to the rise of activation, which should result in an equal 
spacing of the activation centers in an aggregate, independent of the average head 
activation level in the aggregate. We found, however, that aggregates from apical 
tissue (with a high average head activation level) not only formed more heads than 
aggregates derived from basal tissue (with a low average head activation level); the 
spacing of heads formed in apical tissue was also highly irregular. Were the rise 
of lateral inhibition tightly linked to the rise of activation, the spacing between 
heads should be always quite regular. Thus, the kinetics of inhibition and activation 
increase are less tightly linked than previously postulated [12]. This feature allows 
pattern formation to occur in a broad range of initial conditions [42], and adds a 
further level of robustness to the system. 

The inhibition gradient ensures the size control over a given morphogenetic 
field. This size control seems to be a general feature in many embryos of higher 
metazoans that is not well understood. It is also far from clear how inhibition is 
mediated on the molecular level in Hydra. Based on preliminary evidence in higher 
metazoans, there are two alternative possibilities: 

(i) Inhibition could be exerted by molecules which are also involved in growth 
control, like the insulin pathway or nitric oxide [22]. Mutations in the insulin path- 
way have been shown to cause a size reduction of the imaginal discs in Drosophila, 
and ectopic NO synthetase (Nos) expression decreases the size of imaginal discs, 
while inhibition of Nos can increase the leg size in the DV and AP axes [22]. NO 
can rapidly diffuse between the cells, and it could be related to changes in the 
pattern of rapidly cycling cells during head regeneration and budding [20]. 

(ii) An alternative explanation for the inhibition gradient could be that its 
formation is correlated to a more complex interaction of known morphogens with 
antagonistic factors. The two systems in which size control and morphogen gradi- 
ents have been extensively studied are the Drosophila wing disc and the amphibian 
blastula animal cap [6,23]. Here, members of the transforming growth factor beta 
(TGF-/5)/Bmp proteins act as long-range morphogens (100-300 /am), for which 
concentration dependent effects have been confirmed [3,14]. Changes in production 
of Dpp, the Drosophila Bmp homolog, can substantially redesign the Drosophila 
wing indicating a long range action. Recent studies using GFP-Dpp constructs 
suggest, however, a more complex mode of gradient formation including endocy- 
totic trafficking and degradation [9]. The antagonistic factor to Dpp/BMP2-4 is 
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Sog/Chordin forming an opposing gradient. In Xenopus embryos the Chordin gra- 
dient can have a range of at least 450 //m when overexpressed, although its in 
vivo range, which is restricted by the met alloprot ease Xolloid, appears to be less 
far [1]. In Drosophila^ it has been recently directly shown that Sog forms a protein 
gradient in dorsal cells of the embryo [36]. On the dorsal side, Tolloid (Tld) degra- 
dation and a dynamin-dependent retrieval of Sog act as a dorsal sink for active 
Sog [36]. This long-range activity of Sog/Chordin and the related degradation by 
Tolloid/Xolloid could be an important component of the autocatalytic feed-back 
loops involved in the long-range inhibition phenomena and size control of Hydra. 



7. Conclusion and Outlook 

The Wnt and TGFbeta pathways, as well as members of the T-box gene fam- 
ily play a crucial role in the patterning of all higher animals. The data reviewed 
here indicate that Hydra^ a representative of one of the oldest metazoan phyla, 
uses these genes in a signalling center for regulating the establishment of its ma- 
jor body axis. This strongly indicates the antiquity of this patterning system and 
points towards an origin of signalling centers in the earliest multicellular animals. 
These genes have also an important function during de novo pattern formation in 
reaggregates. Aggregates are able to generate complete structures (whole organ- 
isms) starting from a broad range of initial conditions. We therefore propose that 
during early metazoan evolution an extremely robust and flexible self-organisation 
system involving these molecular interactions was selected for, which became con- 
served during the evolution of higher animals. It is an attractive hypothesis that 
the signalling molecules identifled here, and their putative feedback control, rep- 
resent a core network of molecular interactions constituting an organizer which 
might be conserved throughout metazoan evolution. 
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1. The Problem of Biological Pattern Formation 

The generation of the complex structure of a higher organism within each life cycle 
is one of the most fascinating aspects of biology. Life starts, as the rule, with a 
single cell, the fertilized egg. Although many eggs have pronounced asymmetries, 
the final pattern cannot already be present in the egg in a hidden form. How can 
one part of a developing embryo develop differently than the rest, although all 
cells contain, as the rule, the same genetic information? 

At an early stage, many embryos can be fragmented and each part forms a 
complete organism. The early sea urchin embryo is a classical example. In some 
vertebrates, this can occur spontaneously, giving rise to identical twins. This indi- 
cates that a communication exists between different parts, such that the removal 
of some parts is detected in the remaining part and the missing structures become 
replaced. How has this communication to be organized in order that pattern forma- 
tion can occur? What allows regeneration? In many branches of science, theoretical 
analysis plays an essential role to understand complex processes. Development of 
a higher organism may appear too complex to find a mathematical description. 
However, this process can be separated into a number of elementary steps that are 
indeed theoretically accessible. 

2. The Freshwater Polyp Hydra as Living Fossil to Study Axes 
Formation 

Simple organisms has been used to explore basic mechanisms of development. One 
such model organism is the freshwater polyp Hydra (Fig. lA)The animal has a 
length of about 1-4 mm (Fig. 1) contains about 100,000 cells but it has only about 
seven different classes of cell types. It is essentially a cylinder consisting of an outer 
ectodermal and an inner endodermal cell layer. Characteristic structures are the 
so-called head with the tentacles and the hypostome - the proper mouth opening 
of the gastric column for food uptake. Fragments of a hydra can regenerate the 
missing structures to form complete animals, a feature discovered already in 1744 
by Abraham Trembley [1]. Tissue around the mouth opening has properties of 
an organizer: transplantation of small pieces into the body column can induce a 
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new complete head with mouth opening and tentacles. Even after dissociation into 
individual cells and re- aggregation, living animals can regenerate [2]. This shows 
that patterns can be formed from more or less homogeneous initial conditions. 

How does the head-foot axis of a Hydra relate the to main body axes of 
higher organisms? According to the classical protostomia-deuterostomia concept, 
the opening of the gastrula became enlarged and deformed to a narrow slit. One 
end of this slit forms the anterior, the other end the most posterior pole of the 
animal. The slit is assumed to become the midline [4,5]. Comparison of gene 
expression in Hydra and at early stages of higher organisms, however, suggest a 
fundamental different assignment [6] that can be summarized as follows (Fig. 1): 

1. The hypostome with the gastric opening of the Hydra (the so-called 
“head”), corresponds to the most posterior point, the anus of higher or- 
ganisms ( fFn^-expression). 

2. The Hydra-foot corresponds to the most anterior part of higher organisms, 
the forebrain and heart. This is indicated by the Nkx2.x expression. 

3. The tentacle/hypostome border corresponds to the midbrain/hindbrain 
border (posterior boder of O^x-expression) . 

4. A narrow zone next to the gastric opening (region of Gsc/Brachyury ex- 
pression enfolded to form the trunk in higher organisms. 

Thus, Hydra can be regarded as a living fossil, telling us about evolutionary 
early axis formation before bilaterality and trunk formation was invented. In this 
view is straightforward that the Hox gene pattern, typical for the trunk of higher 
organisms, is absent in Hydra: in the common ancestor the trunk was not yet there. 
The system that were patterning once the body of the ancestor was later used to 
pattern essentially the fore- and midbrain of vertebrates. The trunk, the largest 
portion of higher animals was added during further evolution by an enfolding of a 
narrow zone between the tentacle zone and the gastric opening. A crucial step in 
the formation of bilateral-symmetric organisms is the formation of a midline. One 
can show that insects and vertebrates invented completely different mechanism to 
generate the midline [6]. 



3. Generation of a Primary Pattern by Autocatalysis and Lateral 
Inhibition 

Pattern formation is not a privilege of biology. High sand dunes are formed al- 
though the sand is permanently redistributed by the blowing wind. Sharply con- 
toured rivers are formed by erosion although the rain is more or less homogeneously 
distributed over the ground. The formation of lightning from diffuse clouds is an- 
other example. Since all biological processes are assumed to be accomplished by the 
interaction of molecules, a theory of biological pattern formation has to describe 
the change of substance concentrations in space and time as function of the concen- 
trations of the other substances involved. The possibility to generate patterns by 
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Figure 1. (A) the freshwater polyp hydra. (B) With the in- 
ner endoderinal layer and the outer ectodermal layer, the basic 
body plan is close to a gastrula. (C) A drawing from Haeckels 
paper [3], proposing that all higher organisms proceed through a 
similar-appearing gastrula-like stage. (D-F) The oral-aboral axis 
of hydra and the proposed correspondence to the anteroposterior 
axis in higher organisms. The foot of the hydra is proposed to 
be derived from the bottom of the cup-shaped ancient gastrula. 
This part of the ancestor gave rise to the most anterior part of 
vertebrates, the forebrain and heart {Nkx2.x- expression, left). In 
contrast, the opening of the cup gave rise to the mouth opening in 
hydra and to the anus of higher organisms ( Wnt-expresssion) . The 
region around the tentacles correspond to the midbrain-hindbrain 
border (border of Otx-Gsc expression, dark/light grey border, ar- 
rowhead). In vertebrates (F), Gsc (light grey) participates in head 
formation while Brachyury forms the notochord and the tail bud. 
In hydra both genes are expressed next to each other (arrow). 
Thus, this is the zone that gave rise to the trunk (for details, see 
[6], for actual expression patterns in hydra [13-17]). (For colored 
picture see color plate 11). 
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the interaction of two substances that spread with different rates has been discov- 
ered by Turing [7]. Corresponding reactions are generally called reaction-diffusion 
systems. However, most interactions of substances that diffuse with different rates 
would not generate a pattern. Together with Alfred Gierer, I have shown that the 
crucial condition is local self-enhancement and long range inhibition [8-10]. It is 
easy to see that the examples of inorganic pattern formation mentioned above are 
also based on this mechanism: a small deviation from a homogeneous distribution 
has a strong feedback such that the deviation grows further. This increase becomes 
eventually restricted by an antagonistic effect, resulting either from a depletion of 
a prerequisite in the surrounding or by an inhibition that spreads out from the 
self-activating centre. 



4. The Activator-Inhibitor Concept 



According to our theory, a simple molecular interaction with pattern-forming ca- 
pability would consist of an ‘‘activator” whose autocatalysis is slowed down by a 
long ranging “inhibitor”. The following coupled differential equations provide an 
example for an interaction between an activator a{x) and an inhibitor h{x) that 
can generate patterns. 



da pa^ ^ d^a 

dh 2 1 

— =pa + 



( 1 ) 

( 2 ) 



Equation (1) states that a concentration change of the activator per unit time 
{da/dt) is proportional to an autocatalytic production term (a*^), and that the 
autocatalysis is slowed down by the action of the inhibitor (1/h). As any other 
biological substance, the activator molecules become degraded. It is natural to 
assume that the number of disappearing activator molecules is proportional to the 
number of activator molecules present. This is expressed by the term The 

autocatalysis must be non-linear since it must overcome the disappearance by the 
linear decay. Non-linearity of the reaction will result if the active component is not 
the activator itself but a dimer formed by two activator molecules. 

The concentration of a in a given cell may also vary due to an exchange 
of molecules with neighboring cells. Diffusion is the simplest way to take this 
exchange into account. Although diffusion is a good approximation, the real process 
is more complicated and requires a chain of several molecules; molecules that are 
exposed on the cell surface, molecules that act as receptors on neighboring cells 
and molecules that transmit the signal from the cell surface to the nucleus. Other 
modes of redistribution of molecules are conceivable as well. In plants, for instance, 
active transport plays a major role. 

The last term in Eq. (1), <Ja, describes a small basic (activator- independent) 
activator production. This term insures that the concentration of the activator 
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Ficuire 2. Elementary pattern that can be generated with an 
activator-inhibitor mechanism. If the range of the activator is com- 
parable with the field size, graded concentration profiles are gen- 
erated (top). If the field is larger than the range of the inhibitor, 
mnltiple maxima emerge (center). If the self-enhancement satu- 
rates, stripe-like patterns can emerge. Shown are simulations in a 
two-dimensional field. The initial, an intermediate and the finally 
stable distribution is given. For initiation small random fluctua- 
tions in the factor r/io, Eq. (1), were assumed. 



never sinks to zero. This is important for the initiation of the autocatalytic reac- 
tion at low activator concentrations. The term p, the source density or competence, 
describes the general ability of the cells to perform the autocatalytic reaction. 
Slight asymmetries in the source density p (that is, in the competence to per- 
form this reaction) can have a strong influence on the orientation of the emergent 
pattern. A necessary condition for the formation of a stable pattern is that the 
inhibitor diffuses much faster than the activator and that it has a shorter half 
life, i.e., Da Dk and pa < l^h must be satisfied. In other ranges of parameters 
oscillations and travelling waves can occur. The latter modes are essential for the 
pigmentation patterns on the shells of molluscs [10,11]. 
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Starting from a given initial situation, the equations allow the computation of 
the concentration profiles as function of time (Fig. 2). The simulations demonstrate 
that the interaction according to Eqs. (1, 2) hats properties that are baisic for the 
explanation of biological pattern formation. A pattern emerges whenever the size 
of the field exceeds the range of the activator. In fields with a size comparable 
to the activator range, the high activator concentration is formed at one end of 
the field. One side of an embryonic field becomes different from the other side - 
a crucial step in development. The graded activator and/or inhibitor distribution 
can be used to activate different genetic information in different parts of the tissue. 
If the field is larger than the range of the inhibitor, periodic patch-like or stripe-like 
activations can occur. 

The model accounts for pattern regulation such as regeneration. After re- 
moval of the activated region, the remnant inhibitor decays until a new maximum 
is formed via autocatalysis. To obtain a correct description of Hydra regeneration, 
however, few additions have to be made. 



5. The Wave Length Problem - Stabilization of a Monotonic 
Gradient by a Feedback on the Competence 

Usually the size of a morphogenetic field increases during the growth of the em- 
bryo. In a usual reaction-diffusion system a graded concentration profile can be 
maintained only over a range of about a factor two. With increasing field size, a 
tendency exists to change from a monotonic into a symmetric and ultimately into 
a periodic distribution. This occurs either by insertion of new or by splitting of 
existing maxima. 

However, Hydra can grow to a much larger extent without that additional 
heads emerge. Some observations provide hints of how this wave length problem 
is solved by nature. In a Hydra fragment the regeneration of the “head” occurs 
always at the side pointing towards the original head. This suggests that a sys- 
tematic decline in the ability for head regeneration exists from head to foot. In a 
fragment, the side pointing towards the original head hats always an advantage in 
the competition and will win. It is the relative position that is decisive. 

In terms of our model, the range of dominance of the activated over the 
non-activated region can be increased by an order of magnitude if a feedback of 
the activator or inhibitor on the source density (p in Eqs. (1, 2) exists. Due to 
such a feedback, the source density becomes graded too (Fig. 3; the source den- 
sity is the shallow dark grey distribution). In a region of low source density, the 
initiation of secondary maxima is unlikely. In contrast, the rising source density 
at an existing maximum stabilizes this maximum. Thus, the polar distribution 
becomes stabilized. The graded source density provides long-lasting information 
about the polarity of the system. Although secondary maxima are successfully 
suppressed, regeneration is not impaired. A small fragment regenerates a pattern 
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according to the original polarity since the graded source density provides the ap- 
propriate asymmetry. Due to its long time constant, the source density remains 
essentially unchanged during regeneration. Regeneration can be fast since no sym- 
metry breaking and no communication by diffusion over the total field is required 
(Fig. 3, bottom). 

The source density must have a shallow distribution, presumably similar to 
that of the postulated inhibitor. It may appear unreasonable to postulate two 
substances with approximately the same distribution but with opposite functions, 
with an activating influence on the one hand and with an inhibitory influence on 
the other. Why do not both influences cancel each other? Both functions must 
have very different time constants and spreading properties. The inhibitor must 
have a high diffusion rate and a short time constant to enable a stable pattern 
formation. In contrast, the source density must be non-diffusible and must have 
a long time constant. It provides the asymmetry of the tissue during regeneration 
and remains almost unchanged during this process. This dual role may be clarified 
by an analogy. A president (or any other local hero) has usually a strong tendency 
to suppress others from becoming a president too - a long range inhibition. On the 
other hand, he promotes individuals in his surrounding to obtain different ranks in 
a hierarchy, to become ministers, etc. - in this way, the center of power promotes 
the competence. If the center of power would become void, due to this graded 
competence is usually clear who will win the subsequent competition. Moreover, 
the central figure has to inhibit only those that have a high rank in the hierarchy. 
This shortens the time required that one candidate finally wins the competition. 



6. How to Generate Structures Close to Each Other and How to do 
so at a Distance: Head, Foot and Tentacle Formation in Hydra 

The complexity of the patterns in higher organisms requires a hierarchical linkage 
of many pattern forming reactions. One or more patterns generate the precondi- 
tion for a subsequent pattern. Hydra is under control of two organizing regions 
located at opposite ends of the tissue, the head and the foot. This is a common 
feature of many morphogenetic fields. How can it be achieved that two organizing 
centers reliably appear at opposite positions of an extended field? For Hydra^ a 
simple cross-inhibition is not appropriate since in small (young) animals head and 
foot must appear very close together. If at such short distances a mutual inhibi- 
tion between the head and the foot system would be at work, this would lead to a 
suppression of the foot by the nearby head or vice versa. This problem disappears 
when the spacing between the head and foot system is achieved by an interaction 
via the source density. As mentioned above, the head activation appears at the 
position of the highest source density. If the foot system has the opposite behavior, 
i. e., it appears at the lowest source density, the foot is formed at the maximum 
distance from the head (Fig. 3, the foot activator appears at the left side). Never- 
theless, head and foot system can coexist at a close neighborhood in small animals 
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Figure 3. Generation of complex patterns by linkage of several 
patterns forming reaction: Simulation of hypostome, tentacle and 
foot formation in Hydra. Primary head activation (right-most sig- 
nal, light grey) and foot activation (left-most signal, darker grey) 
appear at opposite end of the field due a coupling via the source 
density (or competence; the shallow dark profile). Tentacle activa- 
tion (dark grey) appears close to the hypostome since it requires 
a high source density but it is locally suppressed by head activa- 
tion. The top distributions show the establishment and mainte- 
nance of the signals during growth, the distributions below show 
the behaviour during regeneration. Right: in regenerating near- 
head fragments tentacle activation precedes head activation. The 
new tentacle signal appears first at the tip (dark grey), the region 
of the highest source density. Later, with the rising head signal 
(light grey), it is shifted to the appropriate position, in agreement 
with the experimental observations [13]. Left: in more basal frag- 
ments head activation occurs first. Tentacle activation takes place 
later at the final position after the source density has obtained a 
threshold value (from [12, 10, 18]. (For colored picture see color 
plate 11). 
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since no direct inhibition is involved. The graded source density only generates a 
preference. Experimental evidence indicates that the foot also lowers the source 
density, contributing in this way to the maintenance of the source density gradient. 
In the simulation, for the foot signal, a similar activator-inhibitor interaction has 
been assumed, only that it enhanced by a low source density (1/p instead of p). 

Many structures emerge during development close to each other in a precise 
arrangement. We have shown that a controlled neighborhood of structures is en- 
forced if one structure activates the other on long range but excludes it locally. In 
Hydra^ the tentacles appear around the opening of the gastric column. Many ex- 
periments can be accounted for by the assumption that tentacles are under control 
of a separate activator-inhibitor system that also depends on the source density. 
Since the source density increases under the influence of the primary head sys- 
tem, the latter generates the precondition for tentacle initiation. Locally, however, 
the head signal suppresses tentacle formation. Thus, tentacle formation is possible 
only at a sub-hypostomal position (Fig. 3). The equation for the tentacle system 
is similar as the hypostome system except that the head activator exerts an ad- 
ditional inhibitory influence on the tentacle signal the tentacles have no feedback 
on the source density [12]. 

The model accounts for a strange- appearing observation. After head removal, 
tentacle activation first reappears at the very tip of the gastric column [12, 13]. It 
is only later that this activation becomes shifted to the position where the tenta- 
cles eventually appear (Fig. 3). Since the tentacles are formed close together, the 
tentacle inhibitor need not to diffuse very far. In terms of the model, the tenta- 
cle inhibitor can have a short half life. Thus, after removal of the head and the 
tentacles, the tentacle inhibitor fades away more rapidly than the head inhibitor. 
Therefore, the tentacle activator can reappear sooner than the head activator. 
Since no suppressing head activator is present, this happens at the highest pos- 
sible source density, at the front end of the remaining gastric column. After the 
trigger of the primary head activation at the same position, tentacle activation 
becomes shifted to the final location. The prediction that the sequence of events 
is the reverse in more basal fragments or in buds has found meanwhile direct 
experimental support [13]. 



7. Conclusion 

Our theory shows that relative simple mechanisms account for essential features 
of pattern formation during development as long as our condition of local self- 
enhancement and long range inhibition is satisfied. Self-regulating pattern can 
emerge from initially homogenous situations. Regeneration is described. More com- 
plex patterning can be achieved by the coupling of several such mechanisms. Sig- 
nals to generate several structures appear in a precise spatial arrangement. The 
patterns are self-regulating and defects can be repaired. This spatial patterning 
occurs although the genetic information is identical in all cells. 
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1. Introduction 

Research trying to connect developmental and evolutionary biology has experi- 
enced a renaissance in the last 20 years. Developmental biology is relevant to evo- 
lution because changes in developmental processes and, ultimately, in the mech- 
anisms that regulate development underlie the morphological changes in adult 
organisms that are such a prominent feature of evolution. The increasing interest 
in comparative approaches to animal development is largely due to the accelerat- 
ing use of molecular genetics in elucidating the mechanisms of development, new 
methods for mapping out the fate of cells, and the incorporation of the methods 
for reconstructing phylogenies which have been developed by systematists. In this 
chapter, I will give an overview of recent and current work on the development 
and evolution of the head in vertebrates, using our own work on lungfishes and 
amphibians as an example. After a general introduction to the research field, I 
describe the migration, pattern formation and fate of neural crest cells as well 
as some general issues in evolutionary developmental morphology. Cranial neural 
crest development in vertebrates is evolutionarily very conserved and is begin- 
ning to be relatively well characterised biologically. It has, however received little 
interest from modellers. I present preliminary ideas about how the behaviour of 
these cells could be modelled, drawing inspiration from some recent work on Dic- 
tyostelium development as well as from own previous work on pigment cells, which 
are also neural crest-derived, and their pattern formation during salamander em- 
bryogenesis. 



2. The Transition from Water to Land 

One of the most important events in the history of vertebrates was the transition 
from water to land; the transition from fishes to amphibians, which took place 
in the Late Devonian. Several organs underwent drastic changes, many of which 
affected locomotion. The transition from fins to limbs has received a lot of attention 
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recently, prompted by the findings that none of the early amphibians had five digits 
(the archetypical pentadyctyl limb believed to be the primitive state), but rather 
six to eight [3]. The skull also underwent changes, for example lengthening of the 
snout, enlargement of the eyes and a more dorsal and posterior position of the 
eyes. Some bones were diminished and finally lost, making the skull lighter, for 
example the bones covering the gill chamber. The dentition underwent drastic 
changes, and the skull lost its kinetism, i.e. the internal joints between parts of 
the skull. A neck evolved as the shoulder girdle became separated from the skull, 
and the hyomandibula (a bone which attaches the jaw apparatus to the cranium) 
evolved into the stapes, the first ear ossicle to evolve. Trying to find out the 
developmental changes that caused these major anatomical alterations remains a 
difficult challenge. 



3. Vertebrate Head Development 

The skull and cranial musculature have played an important role in many major 
adaptive transitions in vertebrate history, but fundamental cispects of the under- 
lying processes remain poorly known. Prominent among these are the mechanisms 
that regulate both interspecific differences in cranial form and the integrated evo- 
lution of complex musculoskeletal systems; an important theme in vertebrate head 
evolution. Contemporary models of cranial pattern formation portray the develop- 
ing head as a highly interactive system of coordinated differentiation, morphogen- 
esis and growth [14]; they may offer a mechanistic basis for the highly integrated 
evolutionary change typically observed. 



4. The Neural Crest and Head Development 

The neural crest is a unique feature of vertebrates which gives rise to a large 
and diverse array of adult tissues, including much of the skeleton in the verte- 
brate head [10, 16]. The neural crest plays a fundamental role in that it not only 
directly contributes cells to structures in the head, but also directs, or at least influ- 
ences, the patterning of head structures where the cells are not crest-derived. Many 
neural-crest derivatives were first reported as the results of embryological studies 
conducted during the late 19th and early 20th centuries. Yet, several important 
features have been described only recently, and the full extent of neural crest con- 
tribution to the vertebrate body remains to be elucidated. A direct contribution 
of neural crest to cranial musculature was first reported in the 1980s by Noden 
[20, 21] and later by Couly and co-workers [4], who described the crest deriva- 
tion of connective tissue components of several visceral arch muscles in quail-chick 
chimaeras. More recent work has revealed that this additional role of the neural 
crest is only one component of a much more comprehensive mechanism of cranial 
development and patterning, in which the original compartmentalisation of hind- 
brain segments (rhombomeres), the neural crest, and musculoskeletal derivatives 
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are maintained throughout crest migration, pattern formation and histogenesis 
[7, 15, 27, 28]. This results, for example, in the connective tissue components of 
a given muscle and its skeletal attachment site(s) being derived from the same 
migratory crest stream. It can also lead to cryptic segmental boundaries, which 
may lie within individual connective-tissue elements and thus need not correspond 
to discrete anatomical landmarks. In the chicken, for example, distal portions of 
the lower jaw are derived from first (mandibular) stream neural crest, whereas the 
most proximal portion is derived from second (hyoid) arch crest [15]. Much of the 
classical research assessing the embryonic origins of cranial tissues in vertebrates, 
including derivatives of the neural crest, was based on work done in amphibians 
[10, 11]. These early studies precisely and accurately defined the extensive con- 
tribution of neural crest to many cranial tissues in both frogs and salamanders 
(e.g., cranial cartilages; [29, 30], yet none reported a direct contribution of neural 
crest to cranial muscle connective tissue in any species. Classically, the dogma has 
been that cranial muscle is derived from mesoderm, and that the drastic effects 
on cranial muscle patterning of neural crest extirpation are due to indirect effects. 
However, in the Mexican axolotl [Amhy stoma mexicanum)^ cranial muscle pat- 
terning is severely affected by neural crest extirpation and transplantation, and 
in a manner that is not explained simply as an indirect consequence of altered 
skeletal patterning [12]. In chimaeric South African Clawed frogs, genus Xeno- 
pus, larvae which had received heterospecific grafts of cranial neural crest, labeled 
(crest-derived) cells were observed within several cranial muscles [26]. Thus, the 
possibility of a neural crest contribution to cranial musculature and associated con- 
nective tissue attachments in amphibians remains controversial. Frogs (Anura) are 
of special interest because of the specialized larval head, which includes a unique, 
complex feeding apparatus comprised of novel upper and highly modified lower 
jaw cartilages and muscle configurations found in no other vertebrates [2, 9, 13]. 



5. Emergence and Migration of Cranial Neural Crest Cells 

The early patterning and migration of cranial neural crest cells is highly stereo- 
typed in all vertebrates studied. The cells emerge from the dorsal part of the 
neural tube, and soon form two streams in front of the inner ear anlage, and one 
stream behind it. In both lungfishes and amphibians, this stream soon divides into 
two, and later forms the skeleton of the gill arches. Figure 1 shows this conserved 
pattern in a selection of species. A closer look at the migrating streams of cranial 
neural crest cells, as provided by scanning electron microscopy, shows some typical 
features. The migration proceeds in an anterior to posterior wave, such that the 
anterior-most mandibular stream is the first to appear, followed by the hyoid and 
branchial streams. Neural crest cells organize themselves into these streams by 
migrating away from the margins of each forming stream, as shown in Fig. 2 - an 
embryo of the Mexican axolotl. Here it can also be seen that the individual cells are 
relatively flat and migrate directly on top of each other, which is quite in contrast 
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Figure 1. Cladogram showing the conserved pattern of cranial 
neural crest streams at comparable stages in Australian lungfish, 

Mexican axolotl and a variety of anurans. The species depicted 
are: Neoceratodus forsteri, Amby stoma mexicanum, Xenopus lae- 
vis, Bombina orientalis, Bufo bufo, Rana arvalis, Physalaemus 
pustulosus, and Eleutherodactylus coqui. 

to, for example, the pigment cells (also of neural crest origin) which migrate in 
the trunk of the same embryo. These cells migrate singly and are surrounded by 
an extracellular matrix. The matrix is important for the migration. In the white 
mutant axolotl, the matrix (but not the neural crest cells) is deficient, and there- 
fore the cells are unable to migrate normally [5, 6, 17, 25]. At a later stage, the 
streams of migrating neural crest cells in the head region of the Mexican axolotl 
are well developed, and the more posterior streams have begun to catch up with 
the mandibular stream (Fig. 3). After this stage, the mesoderm cells in the somites 
also begin to migrate and mix with the neural crest cells. In order to follow the 
further cell fate, one has to use some kind of marker system. 

6. Fate Mapping 

We have used a fluorescent lipophilic dye, Dil, which will intercalate into the 
cell membrane and be transported with the cells into which it is injected (Fig. 
4), and all their progeny. Using this approach in frogs {Bombina orientalis) as 
well as axolotls, we have confirmed that most of the head skeleton is neural crest 
derived, and that the different streams give rise to different parts of the skull 
in a segmental fashion (Fig. 5, [24]). This appears to be an evolutionarily very 
conserved trait. Furthermore it was demonstrated, using a combination of Dil 
injections and ablation experiments (in which the cranial neural crest streams 
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F Kill RE 2. A: Scanning electron micrograph of an embryonic Am- 
bystonia mexicanum (stage 20) showing the emergence of cra- 
nial neural crest from lateral view. The mandibular neural crest 
(MNC) migrates ventrally. A hyoid neural crest (HNC) stream 
and a branchial neural crest (BNC) stream are seen further pos- 
teriorly. The direction of migration is indicated by arrows . Scale 
bar = 0.2 inni. NT = neural tube. B: Close-up of A showing (left 
arrows) the split between the hyoid (HNC) and branchial (BNC) 
neural crest streams. The right arrow indicates the separation of 
the mandibular neural crest (MNC) stream from the hyoid (HNC) 
stream. Scale bar = 0.1 mm. 

were operated away one by one) that the cranial muscle attachments points, made 
of connective tissue, where the muscles insert on the skeleton, are also of neural 
crest origin at least in the mandibular and hyoid arches [22, 23]. The muscle fibers 
themselves are not of neural crest, but of mesoderm origin. Interestingly, connective 
tissue components of individual muscles of either arch originate from the particular 
crest migratory stream that is associated with that arch, and this relationship is 
maintained regardless of the segmental identity or embryonic derivation of 
associated skeletal components. These developmental relations define a pattern 
of segmentation in the head of larval anurans that is similar to that previously 
described in the domestic chicken, the only vertebrate that has been thoroughly 
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Figure 3. A stage 25 Ambystoma mexicanum embryo. Ante- 
rior is to the right. Overlying epidermis has been removed. The 
mandibular neural crest (MNC) stream surrounds the optic vesi- 
cle (OV). The hyoid neural crest (HNC) stream migrating rostral 
to the otic vesicle (O) has migrated furthest ventrally of all the 
streams. The neural crest cells begin to disperse from within their 
respective stream into the adjacent mesoderm. Scale=0.2 mm. 



investigated in this respect. The fundamental role of the neural crest in patterning 
skeleton and musculature may represent a primitive feature of cranial development 
in vertebrates. Moreover, the corresponding developmental processes and cell fates 
appear to be conserved even when major evolutionary innovations such as the 
novel cartilages and muscles of anuran larvae result in major differences in cranial 
form. 



7. Evolutionary Developmental Morphology 

We combine the fate mapping approach with traditional morphological methods, 
such as histology, which enables us to document the morphological development 
of form. The array of morphological methods has been augmented in recent years 
by the development of new microscopy methods such as confocal laser scanning 
microscopy. Using this approach, we undertake a comparative study of the devel- 
opmental morphology of the head in lungfishes, salamanders and various types of 
frogs. This allows us to extend the diversity of investigated organisms and gives 
fast, reliable, and three-dimensional views of the developing internal anatomy of 
embryos and larvae. 
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Figure 4. Neurula stage amphibian. The left side depicts six re- 
gions of the cranial neural fold; regions 2-6 were injected with 
Dil. The right side depicts the approximate origins of neural 
crest cells that contribute to the three cranial migratory streams 
mandibular (M), hyoid (H), and branchial (B). Dorsal view; an- 
terior at top. 

Figure 6 shows an example from our study of the concerted morphogenesis 
of skeletal, muscle and nerve elements in the head on the Mexican axolotl. Mus- 
cle is stained green and nerves red. This image is a “flattened” version of a 3D 
stack of images. An understanding of the development and evolution of form in 
biology must be based upon an appreciation of the diversity of living organisms 
as well as the unity of developmental regulatory processes. The presently dom- 
inating approach in developmental biology is to concentrate all efforts on a few 
model organisms, typically the worm {Caenorhabditis elegans), the fly {Drosophila 
melanog aster), the fish {Danio rerio), the mouse {Mus musculus) and at best a few 
more species. These organisms have been selected for a variety of reasons (short 
generation time, canalized development, etc.) which have nothing to do with their 
suitability for comparative studies [1]. This model systems approach needs to be 
supplemented with a phylogenetically based comparative developmental biology. 
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Figure 5. Larval skull and cranial musculature of Bombina ori- 
entalise depicted in dorsal (left) and ventral views. Neural crest- 
derived cartilages are shaded according to the migratory stream 
from which they originate (redrawn from Olsson and Hanken, 
1996) (For colored picture and more detailed explanation see color 
plate 12). Cartilages: BB, basibranchial; BH, basihyal; CB, cera- 
tobranchials I-IV; CH, ceratohyal; CT, cornua trabecula (trabec- 
ular horn); IR, infrarostral; MC, Meckel’s; OC, otic capsule; PQ, 
palatoquadrate; SR, suprarostral; TP, trabecular plate. Muscles: 
lev, levator mandibulae group mlrna, levator mandibulae ante- 
rior; mlmaa, levator mandibulae anterior articularis; inlinas, lev- 
ator mandibulae anterior subexternus; rnlmp, levator mandibulae 
posterior (comprising two parts; superficialis and profundus); ang, 
angularis group rnha, hyoangularis; mqa, quadratoangularis; rnsa, 
suspensorioangularis; hyoideus group mih, interhyoideus; moh, 
orbitohyoideus; msh, suspensoriohyoideus; osh, orbito- and sus- 
pensoriohyoideus; others mia, intermandibularis anterior; mip, 
intermandibularis posterior; mml, mandibulolabialis. Anatomical 
nomenclature follows Cannatella [2]. 



where the questions asked determine which organisms to use. To this end, we need 
to integrate what are still very different empirical enterprises (molecular genetics, 
comparative morphology, phylogenetics, palaeontology) with theoretical models 
and simulations of both developmental and evolutionary changes in form. 
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Figure 6. A larva of the Mexican axolotl stained with antibodies 
specifically recognising muscle and cranial nerves. Following im- 
munostaining, the larva was bleached and made transparent, and 
subsequently imaged using a confocal microscope. This allows op- 
tical sectioning through the larval head. The optical sections were 
then put together and all parts of the larva are in focus at the 
same time. (For colored picture and more detailed explanation see 
color plate 13). 



8. Modelling the Emergence of Cells and Formation of Streams 

The later development of the head in vertebrates, when the different morpholog- 
ical features that characterise larval and/or adult animals start to appear is a 
very complex phenomenon, and it is not very fruitful, I think, to try to model 
it before we know more about the biological regulatory mechanisms. The emer- 
gence and early migration of cranial neural crest cells, on the other hand, seem 
amenable to a modelling approach. As described above, the cells move on top of 
each other, and the streams have a certain similarity to Dictyostelium discoideum 
slugs. Dictyostelium is a biologically much better understood system, and there is 
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a long history of modelling it as well. Recently, Maree [18, 19] published a model 
in which the slug behaviour is due to the collective behaviour of the amoebae in 
the slug. Analogously, the cranial neural crest streams move as a net effect of the 
movements of the cells constituting them. While the cues directing the movement 
are probabaly different (there is empirical evidence that ephrins and their recep- 
tors are important in neural crest patterning rather than cAMP, and signals from 
the surrounding tissues, not light or temperature, are important for directing the 
streams), it could still be very fruitful to try a similar approach to those used 
by these Dictyostelium workers. To model cell signalling using an excitable media 
approach, and the movement of cells relative to each other using differential cell 
adhesion certainly seems like a reasonable approach also in our system. We have 
previously used differential cell adhesion to explain the migration and pattern for- 
mation of the trunk neural crest cells which give rise to pigment patterns in axolotl 
embryos [8]. In this model we used cellular automata to model cell movement, an 
approach which is more intuitive to the biologist than the systems of partial dif- 
ferential equations commonly used by theoreticians. A model along the lines of 
our previous efforts would have to deal with movement in three dimensions, and a 
less clear-cut system. The pigment cells come in two categories (black and yellow) 
which can be assigned different adhesion values based on biological knowledge, 
whereas the cranial neural crest cells do not fall into obvious categories like that. 
Still, the systems are quite similar and the same fundamental process (differential 
cell adhesion) seems to play in important role in both cases. 



Maree’s Dictyostelium work also points to and illustrates a difficult problem. 
On the one hand, experimentalists like myself tend to want a model that incorpo- 
rates as many aspects of the biological system as possible, and refining the model 
to meet this demand tends to make the model so complex that we risk to lose 
the overview and understanding that the model was created to give in the first 
place. When the model becomes as complicated and hard to understand as the 
biological system, this approach has reached its final, self-defeating stage. On the 
other hand, if the model becomes as general as the theoretician might strive for 
it to be, it might not help us very much in our struggle to explain our particular 
biological system, regardless of how generally applicable, simple and beautiful it 
might be. The biologist is then often left with an empty feeling. Fortunately, the 
solution, to “seek simplicity, and distrust it” was pointed out a long time ago by 
A. N. Whitehead. A modelling approach will also force us experimentalists to un- 
dertake new investigations, in the case of cranial neural crest streams, trying to 
label cells (with Dil or GFP) and monitor their movements within (and possibly 
between) streams is maybe the most interesting and challenging task for the near 
future. Describing, and hopefully someday being able to explain, how the archi- 
tecture of the vertebrate head, with its many different tissue and cell types, comes 
into being during development and has been modified during evolution, remains a 
grand, albeit distant goal for my research. 
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1. Essential Steps in Modelling 

All modelling starts with an analysis of the structure problem: What resolution 
has to be put to the system elements and their interaction? For tissue modelling 
and corresponding developmental systems this question amounts to the problem 
to assign resolution both to the cell’s internal behaviour - and to the cellular 
interaction. The basic parts of the system need no identification, as we immediately 
make a natural choice, i.e., taking cells as the basic elements of the model. But 
truly immense problems do arise here: We know so much about each single cell, 
just to quote genetics and proteomics. It is here where the often cited ’’information 
flood” of modern biology is currently taking place. How much information can 
we incorporate? Clearly this depends on the scientific question we would like to 
answer. On the other end of the scale are the immense complex collective properties 
of the tissue, resulting from the cell’s internal dynamics and the cellular interaction. 
Typical scientific questions on this scale are of the following kind: At which regions 
of the developing organism do certain substructures (like the mesoderm, etc.) first 
appear by cell differentiation? Which cells from which locations of the early embryo 
form these substructures? Or: How fast does the tissue or parts of it grow? (This 
simple question is obviously one of the most important issues in cancer research). 
Or: What is the influence of geometry? (see [2] for an example). Or a typical 
physiological problem: What is the filtering capacity of a liver with respect to 
certain metabolites? And also: What are the mechanical properties of a tissue? 



1.1. Density- type Tissue Models and Upscaling 

If we take the last question it can easily be noticed that scientific research of 
body structures needs not necessarily be based on the cellular level. There are for 
example sophisticated implementations of finite element discretisations of partial 
differential equations (PDF) describing bone and tissue mechanical properties, as 
required for surgical purposes (see [33] ) . Another area where this remark certainly 
holds true are vasular systems (see [24]). With respect to these previous examples 
a second typical step in modelling has to be discussed: Finding bridges between 
scales by taking limits. It would not be appropriate or even possible in most cases 
to study numerically a problem of blood flow, say in the heart, on basis of a heart 
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Figure 1. Basic modelling decisions: Density-based (top, (a)) vs. 
particle- based (bottom, (b)) models. 



muscle consisting of single cells (here the physics is in the foreground, and not the 
genetics etc. of some cell building the tissue). 




lipid bilayer 
inside cel! 



fraction of tissue with 
single cells and 
extracellular matrix 



"homogeneous" tissue 

► 



increasing spatial scale 



Figure 2. Scaling relationships, here with respect to space. 

But often the density description can be understood as limits of many par- 
ticle systems, where the number of particles is going to infinity in an appropriate 
scaling (as an example we mention chemotactic cells, see [28]). Similarly a cer- 
tain topological feature might be repeated infinitely often, such that by upscaling 
the problem the effects of micro-structures result in an averaged behaviour (cell - 
gap junction - cell etc., this textbook example is discussed in [14]). The relevent 
mathematical theories are multi-scale and homogenisation analysis [1,30]. As the 
titles suggest most results are applied to material science, the upscaling of biolog- 
ical properties from cellular behaviour is a wide open field with few results! The 
problem of either considering particle-based vs. continuous models is also relevant 
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in mathematical developmental biology. Most results on pattern formation have 
been derived for reaction-diffusion equations (starting with [31], later [10,22,27], 
for some recent developments see [9]). In most cases the complexity of the biology 
of developmental organisms does currently not permit to give a rigorous derivation 
of these equations from particle- (here cell-) based systems, but as macroscopic 
“carricatures” with heuristic reaction terms these equations have played (and do 
still play) an important role to suggest and investigate the basic (pre-) patterning 
processes needed for self-assembly of the embryo. 

1.2. Cell-Based Models 

Nevertheless it is in many cases necessary to study the self-assembly properties of 
living tissue directly on the cellular level. Even for the price that only moderately 
many interacting cells (and no continuum) can be studied in this case. There are 
two main reasons for this: Small-scale variations of cell- type and the model relation 
with respect to experimental input First consider a typical situation, a small section 
of the colon, see Fig. 3, which is a sketch of a figure that can be found in [23]. 

Due to the harsh conditions in the colon, the colon cells must constantly 
re-assemble the tissue to keep it functional. Cell division and cell positioning must 
be coordinated over time, with new cells taking the positions of their dead pre- 
decessors. There are a number of specialised cells involved, like stem cells at the 
bottom of each crypt (bottom of Fig. 3), other already differentiated cells absorb 
nutrients etc. Therefore the small-scale differentiation pattern is essential for un- 




Figure 3. Small-scale cellular diversity. 

derstanding the genetic-, metabolic-, and intra-/inter- cellular signalling networks 
at meso-tissue scale (on the order of several thousands of cells), which most prob- 
ably is the correct resolution for this type of problem. To be more precise at this 
point, it might not be the relevant macroscopic scale at which certain properties 
like mechanical stiffness of organs and bodies can be derived (elasticity equations 
etc.), but it is the right scale to incorporate cell differentiation, small-scale cellular 
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growth by cell division, coordination of cell movement and cell positioning, and 
the relevent genetic and signalling circuits in- and outside the cell that coordinate 
these actions. There is a chance to relate such modelling to data basses, as minimal 
genetic and proteomic structures can be incorporated. The success of quantita- 
tive biology will more and more depend on a cooperation between bioinformatics 
and mathematical modelling. Besides sequential and discrete data arising from 
sequence analysis and micro- arrays, such models can also profit from the renais- 
sance of modern light microscopy techniques and fluorescence markers. It can be 
hoped that future achievements in upscaling will allow more precise derivations of 
macroscopic equations describing the ability of pattern formation and biophysical 
properties of whole organs and organisms. In the meanwhile, cell-based models 
can help to reformulate ideas and pictures like the example in [23]. Hopefully they 
translate such problems into quantitative models which can better be tested in an 
iterative way by experiments. 



1.2.1. What Are Cell-Based Models? 

There are different possibilities to represent cells, their division, growth and re- 
location, depending on some fundamental choices. The mathematical alternatives 
are (deterministic or stochastic) cellular automata on regular or irregular lattices, 
or lattice-free alternatives. This terminology refers to a discrete pre-defined loca- 
tion of the cells in the model, or a continuous spatial state space in the latter 
case. Similar alternatives have to be discussed for the temporal state space. As 
cellular automata are described in other chapters of this book, we emphasize the 
need to study lattice-free models as well. Because such models are (most likely 
for historical reasons) presently uncommon (but see [7,17] and references therein 
as counter-examples), a formal mathematical description for one of the simplest 
possible tissue models in which internal cellular networks are incorporated is given 
in section 3. In which situations can such models be used? From a numerical point 
of view they are close to adaptive grid methods, which implies they should be su- 
perior in cases where the model is selecting its own scales in a certain range. This 
very likely occurs when cell growth, small-scale relocation and cell division have 
to be modelled in detail, because cell position and length and therefore local cell 
densities can be appropriately mapped into the model. Also if some field of con- 
centrations needs to be incorporated, like the distribution of signaling molecules 
“ whose time and space scale are different from those of the cell, and therefore 
can be treated as a density distribution this approach has advantages. In other 
cases there might not be any need for such overhead (like knowing the cell’s posi- 
tion in continuous coordinates). But it should again be emphasized that currently 
there is a lack of tested standard models and therefore also a lack of respective 
approximation theories. 
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2. Modelling with Internal and External Structure 

As outlined in the introduction we can decompose the model description into split- 
ting the structure into the assignment of internal and external states which influ- 
ence the cell’s behaviour. Internal structure variables are the most likely candidates 
(for most developmental problems, excluding for example mechanical properties) 
with the following two categories: 

• Intensity of genetic translation (correlated to mRNA concentration). 

• Concentrations of members of the cellular protein network (cellular meta- 
bolism and signalling). 
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Figure 4. Basic cell-cell interactions. A: Diffusing ligands in the 
extracellular matrix bind to cell surface receptors. B: Direct con- 
tact between cell membrane proteins and surface receptors C: Cell 
communication via gap junctions. 

The cell is assumed to be a well-mixed reactor (compare with Fig 4), with 
a homogeneous distribution of all the interacting molecules inside the cell. This 
assumption is conveniently made, but is surely problematic in some cases. Here 
more model results on cellular transport (gated, trans-membrane and vesicular 
transport) are needed. As we are interested in the dynamical behaviour of cells, 
variations of state variables are described by ordinary differential equations (other 
choices are stochastic variants or dynamical systems with discrete state spaces, 
for example difference equations. Evolving Boolean or multi-state networks can be 
used to cover both linkage structure and dynamics of cellular networks) Especially 
genetic networks have been often described by discrete state spaces, for example 
Boolean networks (see [13]). It is often a matter of time-scale separation to reduce 
the complexity of the network topology, especially the links between the genetic 
and metabolic levels of organisation. 
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2.1. Complexity Reduction 

Much of the complexity of dynamical networks can usually be reduced by time- 
separation arguments, i.e., we only need to incorporate slow variables, whereas 
the fast ones reduce to algebraic side conditions. Much work in that respect has 
been done with chemical reactions [6, 16], and is very similar to the reduction 
techniques used in enzyme kinetics (Michaelis-Menten-kinetics etc.). But clearly 
the automatic reduction of a massive protein network with hundreds of reactions 
to a system feasible for cellular modelling (say with one to 10 dimensions) is 
still very much a modeller’s personal choice. More insight into this problem is 
expected from results obtained from the current interest in biological network 
topologies (both of genetic and proteomic networks, see for example [29]). There 
are also methods available that can dissect such network topologies, for example 
the so-called elementary flux-mode analysis, see [12]. They are very useful as only 
stiochiometric constants and the linkage structure of the involved species have to 
be known. But they cannot incorporate dynamical aspects as they assume the 
network is in equilibrium. Hopefully many of the pathways are redundant and not 
used during certain stages in development. As can be seen in Fig. 4, concentrations 
of membrane proteins (receptors) are also an obvious choice to be taken as state 
variables. Not only the concentration of signalling molecules in the extracellular 
matrix, but also the ability of the cell to sense a signal can be important. 

The external structure of each cell experience can be essentially treated in 
three categories: 

• State variables describing geometrical aspects of the surrounding tissue 
(transport properties of the extra-cellular matrix, extension of the tissue 
etc.) 

• Gradients of signalling molecules. 

• Non-autonomous environmental conditions, like temperature variations 
(can usually be excluded under laboratory conditions). 

Diffusing molecules are in most tissues at this scale the only long-range sig- 
nals (see again Fig. 4) usually called morphogens. In a standard textbook the 
following definition can be found: “A chemical whose concentration varies, and 
which is involved in pattern formation, is called a morphogen'\ see [32]. Mor- 
phogens are usually taken as markers of “positional information” (a term also 
coined by Wolpert). Other choices can be made. Cell state variables in contrast 
to external state variables can be taken, like receptor surface densities or the 
translation/blocking of genes, resulting possibly in cell differentiation (in case the 
change of these cell state variables has become irreversible). Besides morphogens, 
cellular interaction is restricted to local interaction between neighbouring cells. 

The next section 3 gives a more rigorous mathematical framework for the 
ideas presented in this chapter. It can be used as a framework to construct cell- 
based models of tissues. The ideas are analogues of physiologically structured pop- 
ulation models [3 -5,15,20], with the only difference that finitely many cells with 
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spatial positions are considered (instead of a density of cells, and no incorporation 
of spatial position). 

3. Lattice-Free Models Based on Single Cell Modelling 

At time t let there be N{t) cells present in the system. For this simple model we 
assume that cells are arranged in a one-dimensional array, modeling for example 
the growth of a developing body along its body axis. Well-fitting examples are the 
cylindrical body plan of Hydra (in the case of perfect rotational symmetry), or 
the elongation of plant roots. Cells cannot move out of this array and reallocate, 
making it a bad description for certain developmental stages where cells are freely 
moving to re-organise the body plan (Most relevant during gastrulation. The model 
is an extreme case in the sense that cell adhesiveness is maximal, and thus cell 
motility minimal. Both assumptions can be changed). Each cell has a label z, with 
i = 1, 2, ..., A^(^). The cell with label z has at least three individual state space 
components, 

• its position Xi{t) G M, 

• its length U{t) G R+, 

• and a physiological component, Ct{t) G ^ which includes mea- 

suring the position in the cell cycle. (So, if p = 1, €.,{1) could be taken as 
a cyclin concentration.) 

3.1. Cell Division 

As a further modeling step we require that the state of c*, (t) completely determines 
the times at which cell division occurs. We express this in the following way: 
There exists a functional F : ^ R such that if at time t' we have for some 

z' G {1, 2, ..., A/'(^')}, F{cif{t')) = 0, a cell division occurs for the cell with this 
label. One daughter cell inherits the label z', to the other one a new label z" is 
attached, thus increasing the number of present cells by one. 

The assumption of a cell division event defined implicitly by the equation 
F{cr{t')) = 0 makes the model completely deterministic if all cell state variables 
including Ci are described deterministically. Many cell division models assume sto- 
chastic cell division times, often because internal states determining cell division 
(such as cyclin concentrations) are not modeled explicitly. Nevertheless the vari- 
ables taken to construct c/ might not be sufficient for deterministic times t' of 
division. In this case we can introduce an evolving probability distribution for the 
chance of division at time t, with t' being the expectation. This complicates the 
algorithm as time steps At in the discretisation of the stochastic process have to 
be adapted such that in At the probability to have more than one division event 
is very small. See 25 for efficient implementations. 

3.2. Update Rules 

At division time t' , the position and length of the cells become discontinuous, but 
not the maturity c^{t) which keeps track of the cell history, i.e. the cell stores for 
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Figure 5. Illustration of the update rules for one of the simplest 
one-dimensional tissue formation models based on single cell dy- 
namics, see text for details. 



example the information of how many times it has already divided before. The 
update rules at time t' are self-explaining (see Fig. 5 for the geometrical interpre- 
tation): 



N{t'+) = 






with i” = N{t'+) = N{t'~) + 1, 

lAt'-) 

2 ’ 



— ) for all i G {1, 2, ..., N{t' — )} \ {z'}, 



h{t'+) = hit'-) for all z G {1, 2, ..., N{t'-)} \ {z'}. 

Here a “ ” indicates a value after, and a ” the value before the update 

event. There are different possible choices for modelling the (continuous) transi- 
tions of the cell’s individual states during time intervalls where no cell division 
occurs. Here we use a deterministic variant, as it is the simplest possible assump- 
tion, but other choices can obviously be made. Let 0 < • • • < 

m G N be an ordered sequence of time points inside the interval [to,tm] during 
which we observe the system. 



3.3. Interaction Variable 

All cells will in general depend on external conditions like environmental temper- 
ature. Besides temperature which most likely does not involve spatial gradients 
in the part of tissue we are considering, other external fields such as hormonal or 
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oxygen (or other dissolved gases) concentrations might well form spatial gradients. 
In any case we have to decide whether such external variables are given quantities, 
or depending on the spatial scale being chosen have to be closed by feedback 
loops incorporating cellular interaction, see section 3.5. Consider for example the 
concentration of ligands around the cell membrane, a quantity surely influenced by 
local cell interaction. By this fact such quantities (say there are p such quantities) 
are collected into a so-called interaction variable /(C ^), i-e. I : [to, tm] x ^ 
g G N, with f7(t) = being the spatial domain which the cell system oc- 

cupies at a certain time t. The quantity / is interpreted as a held with which each 
cell communicates with its “exterior” (here “exterior” means both extracellular 
matrix and other cells, which possibly have connections with the cell by gap junc- 
tions, see Fig. 4. A fundamental assumption will be that we only need to know the 
values of I at the cell centre x/, neglecting variations of the cell’s extension. The 
mathematical idea behind interaction variables is the fact that with a given I the 
model separates completely into single cell behaviour, allowing for the construction 
of iteration schemes [15] and numerical parallelisation strategies. 

3.4. Cell State Variable Changes 

Let tj , 1 < i < m—1 he the times at which a cell division takes place. The cells will 
possibly grow (changing their length /, , inducing also a change of their position, the 
centre of the cell, .tJ, and change their genetic, metabolic and signaling activities 
(described by the vector c,, the cell state). We assume they do this in a continuous 
way, depending on their present state (which is again described only by li,x, 
and c,.), and on conditions coming from outside the cell under consideration, 
modeled by the interaction variable I . Finally we have to provide a deterministic 
law describing each cell’s length, position and state in between division times, 
provided with the necessary boundary conditions at to and at the division times. 
First we consider the change in length: 

for alH e {1, 2, A^(«'+)} 
at 

and tj-\ < t < tj^j = 1, . . . , m, 

{ii) 1,{U)) = for all i G {1, 2, Af(fo)}, 

{Hi) I, {tj+) = ^or a\\ i G {1,2, ...,N{t'-)}\{i'}, (1) 

{iv) li>{tj+) = ^ 

(v) h"{tj+) = for all j = 1, . . . ,m - 1. 

Here 

X x R+. 

Eq. (1), (ii) is giving the initial conditions at time to, (iti), (iv) and (v) are 
update rules at cell division. Remember that at a division event the cell with index 
i' split into two daughter cells, one inheriting the index i', the other one creating 
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the new index i" = N{tj-) -\-l, j = 1, . . . , m - 1. Next consider the change in cell 
state: 

(*) = gc,iih{t),Ci{t), I{t, Xi{t))), for alH e {1, 2, . . . , N{t'+)} 

at 

and tj-i < t < tj^j = 1 , . . . , m, 

(a) Ci(fo) = Ci, 0 , for all i e {1,2, ...,A^(fo)}, (2) 

{Hi) Ci{tj+) = c^{tj-), for alH e {1,2, ...,7V(f'-)}, 

(iv) for all j = 1, . . . ,m - 1. 

Also 

Qc^i • ^ X IR^_|_ — > M_|_. 

The same interpretations apply as for Qc,i^ Note that assymetric cell division 
can be modeled by not distributing the vector q equally to daughter cells a,s stated 
in Eq. 2, (iv). The dependencies on li can account for necessary volume corrections 
of concentrations during cell growth. 

We do not have to consider the change of cell position x^, as this quantity is 
defined as the centre of the cell and can be computed by the knowledge of each cell’s 
length. This is an implication of the one-dimensional array structure. This choice 
would not be possible to make in more sophisticated models where cells are able to 
move freely. In this case we would have to introduce an additional transport model 
for the cell population. In case of cell adhesion and an incorporation of mechanical 
stresses due to cell growth and division, cell-cell neighbourhood relation can also 
change in multi-layer or multi-dimensional tissue models (see [17]). 



3.5. Cellular Interaction 

A possible choice would be a field describing the concentration of a molecular 
species (incorporated as a component into causing the cells to aggregate 

by chemotaxis. On basis of this example we still have to give a mathematical de- 
scription of those components of I{t, •), which are infiuenced by other cells’ states. 
This excludes in most cases temperature (which would be a given distribution, 
making the equations non-autonomous) , but includes the distribution of certain 
ligands released to the extra-cellular matrix, see Fig. 4, A. Assuming, a^ the most 
fundamental and most frequent example, that the ligand’s transport process in 
the extracellular matrix is governed by Brownian diffusion, an equation describing 
this component of /, say 7/(t, •)? is given by 



dli{t,x) 

dt 






N{t) 



= Di-^Ii{t,x) + x^ii,3:)[rl{h{t),Ci{t)J^,{t)) - bi{h{t),c,{t), 



for (t,x) G X ^{t) 
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d 

— Ii{t,x) = 0 for (t,x) € x dCl{t), 

Iiih,x) = Iim{x), with //,o(x) > 0 for X e f2(t), 



[boundary conditions] 
[initial condition] 



with 



Xi{t,x) 



1 : X e {x^{t) - !^,Xi{t) + 

0 : X e ^{t)\{xi{t) - 
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x) dx. 



The interpretation of these terms is obvious, \i is a characteristic function mod- 
elling the extension of a cell in the tissue, is the average concentration of ligands 
a cell experiences at its “surface”, D/ > 0 is the diffusion coefficient of the ligands, 
and 



r/ : X x ^ > R+ 

is a function modelling the release of ligands of type I from each cell per unit 
of time, which is assumed to depend on its state c, and the current signalling it 
receives (the cell’s environment), namely (which is a vector, i.e., can depend 
on other ligands than 7/ or on temperature etc.). Ligands are bound at the cell’s 
surface by a rate modelled by 



bi :R^ X X — > IR+ 

which depends on the same variables as r/. The dependence of ri and 5/ on Ij 
can account for volume correction of the growing cell. In another example where 
a component //,. describes the direct cell-cell communication via gap junctions, 
h- will be a piecewise constant function by the assumption of cells being well- 
mixed local reactors. It suffices to solve equations with discrete diffusion terms, 
i.e., equations of the form 

dh{t,x,) ^ k{x,_i - x,:)(7a.(^,x,_i) - Ik{t,Xi)) 
at 

+ Df.. k ( x , + i - X,)ih.{t,Xi+i) - h.(t.Xi)) 

+ hk{hit),Ci{t),I^,{t)), (3) 

h iU),Xi,) = h,A){xi) for all i€N{t), 

where lk(t,x) = Ik{t,Xi) for x G (x,;(f) - ^^,x,(f) + n being a spatial 

step width correction depending only on the distance between cells, and hk : 
X X is describing the internal reactions between cell state 

variables (i.e. enzyme concentrations etc.), the signaling molecules described by 
Ik, and possibly other components of 7. The update of initial conditions (last line in 
Eq. (3)) after a cell division is according to the inheritance principle as previously. 
With these ingredients the model is finally closed, i.e., all necessary components 
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for a simulation and or mathematical analysis of the cellular developmental system 
have been presented. 
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The understanding of complex diseases as, for example, cancer growth, as well as 
of the corresponding immune response, requires a detailed analysis of the interplay 
between different complex physiological regulation processes. These include intra- 
and intercellular regulation and their effects on tissue cells, as well as the humoral 
or cellular responses of the immune system, the endocrine and the nervous system. 
In order to obtain a better insight of this interplay, it is reasonable first to focus 
on situations which involve studying one of these components and in a next step 
consider situations where two or more of them are involved. 

Along this line of argument, this chapter presents six articles dealing with 
theoretical approaches intended to enhance understanding of these complex prob- 
lems. 

The first three articles concentrate on mathematical models of progressive 
stages of tumor growth, followed by an article on aspects of the immune system, 
an article on the fusion of enveloped influenza viruses with a host cell, and one on 
spatio-temporal rhythms of dynamic diseases. 

In the contribution by Drasdo, Dormann, Hohme and Deutsch, two mod- 
els for in vitro avascular tumor growth are discussed. The results suggest the 
presence of three basic growth regimes: an exponential growth, a power law like 
growth and saturation of growth. One of the models presented is an off-lattice 
model in three dimensions, the other is a lattice approach in two dimensions. The 
three-dimensional off-lattice approach operates in a regime where nutrient concen- 
trations are still high, whereas the two-dimensional lattice approach focuses on 
the regime where nutrient is already lacking. Both models “overlap” in the linear 
growth regime of the tumor. They show that this regime can be explained by both 
biomechanical and biochemical effects. 

The next stage of tumor growth, the early vascular growth just after angio- 
genesis, is discussed in Anderson’s contribution. Here, a hybrid model is introduced 
which considers tumor cells, the surrounding host tissue, matrix-degradative en- 
zymes, and oxygen. The tumor cells are modeled as individuals, the latter three 
entities as densities. Simulations show that cell-cell interaction is important at 
an early stage of the process, but during the subsequent loss of cell-cell adhesion 
molecules via addition of mutations, the later stages are dominated by cell-matrix 
interactions, which define the tumor geometry. 
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Coevolution of tumor cells and the immune system is discussed in the article 
by Walden. A short review of results related to tumor-cell properties, diversity of 
tumor-cell lines and the anti-tumor immune activity is presented. These show that 
experimental evidence suggests a need to interpret the interrelationship of tumors 
and the immune system as a coevolution process. 

Immune reactions can detect and eliminate many of the transformed cells 
and may prevent cancer right at the onset. As in the evolution of species, tumors 
may become extinct when their progression is not in balance with the resources of 
their environment, when the environmental conditions change to become less per- 
missive or when they are eliminated by selection. But in manifested tumors, cells 
have managed to escape the selection pressure of the immune system. The paper 
stresses that more effective cancer therapies might have to be based on further 
investigations conducted from the point of view of the coevolution of species. 

Behn, Brede and Richter discuss two different mathematical approaches for 
the understanding of immunological problems of different complexity. One is a bit- 
string model to study the architecture of the idiotypic network of B-lymphocytes 
and its random evolution. It is shown that an idiotypic network architecture ex- 
ists where immunological demands like the completeness of the repertoire and a 
persistent immunological memory can be fulfilled. Next, a non-autonomous non- 
linear system of differential equations is introduced which describes the dynamics 
of T-cell differentiation in dependence on antigen presentation, and its role in the 
desensitization of mast cells. The understanding of the differentiation process pro- 
moted by this model leads to suggestions like how hyposensitization therapy of 
allergy is best applied. 

In the contribution by Schreiber, Ludwig, Holzhiitter and Herrmann a lattice 
model is simulated which describes fusion between hemagglutinin expressing cells 
and appropriate target cells, thus representing the formation of pores during the 
fusion of the enveloped influenza virus with the endosomal membrane of the host 
cell. The simulation lattice represents the contact area between two membranes 
arranged in parallel and in a sufficiently close distance. Diffusion of viral glyco- 
protein hemagglutinin trimers, which mediate the pore formation, and receptors is 
modeled by random transitions between adjacent compartments of the simulation 
lattice. The stochastic dynamics of different types of molecule in the contact zone 
is modeled by a multivariate master equation for the (conditional) probability of 
finding certain molecules in a certain spatial compartment at a certain time. In 
this way the time evolution of the number of free and clustered membrane proteins 
as well as the probability of the occurrence of pores are calculated. The latter is 
directly compared to experimental results. 

Physiological rhythms appear at nearly all levels of morphological organiza- 
tion. An important feature of this rhythmic behavior is its ability to respond in 
a characteristic manner to different external requirements. Therapies should take 
this into account. One example is the waves of spreading depression. In the ar- 
ticle by Dahlem, Mair and Muller a review of experimental results on spreading 
depression and their interpretation as traveling wave fronts in excitable media is 
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presented and put into context with other spatio-temporal dynamical diseases. 
The connection between spreading-depression and the aura in classical migraine 
is shown. Especially, an ansatz to explain the fortification illusion is suggested. 
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1. Introduction 

Unveiling the secrets of tumor growth mechanisms is particularly important for the 
derivation of better therapies. Besides sophisticated molecular-biological methods 
the application of mathematical models to deal with selected aspects of tumor 
growth has become extremely popular in recent years. Mathematical models are 
particularly suited to analyze collective effects in tumor populations. Depending on 
the desired spatial resolution of the model one may either consider locally averaged 
quantities as, for example, the local cell density which leads to reaction-diffusion 
type equations [1] or formulate models in which each individual cell is represented 
explicitly. So far, individual-cell-based models have been applied to the avascular 
and the angiogenetic growth phase [2, 10, 12, 15, 28, 29]; for brief reviews on 
individual-cell based models see e.g. ref. [5, 14, 26]. 

Here, we focus on pattern formation in multicellular spheroids which have 
been proposed as experimental model system for the study of avascular tumor 
growth [16, 18, 19]. In vitro growing avascular tumors provide excellent test systems 
for mathematical models since they have been extensively studied experimentally 
and are accessible to all kinds of manipulations which allows experimental tests 
of model predictions. In a typical experiment, tumor cells are grown in culture 
or suspension and are repeatedly exposed to fresh nutrient solution. An initial 
exponential growth phase of the population size is either followed firstly by power- 
law-like growth (accompanied with a linear growth of the tumor diameter) [18, 
19], then saturation [16], or, directly by saturation [19], depending on the precise 
growth conditions. A cross-section of the tumor in a later stage exhibits a layered 
structure: a core zone composed mainly of necrotic material is surrounded by a 
thin layer of quiescent tumor cells and an outer ring of proliferating tumor cells 

(Fig- 1)- 
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In this contribution we present two complementary individual-cell based ap- 
proaches, namely an off-lattice and a lattice (cellular automaton) model. The off- 
lattice approach describes a cell similar to a Brownian particle capable of growth 
and division and allows to directly include biomechanical and kinetic cell param- 
eters. It is used here to provide a potential explanation of the growth kinetics in 
experimental findings of Preyer and Sutherland [18]. The hybrid lattice gas cellular 
automaton elucidates the growth saturation and self-organized formation of the 
layered structure in multicellular spheroids as shown by a direct comparison with 
the experimental results of Folkman and Hochberg [16]. Our simulations suggest 
a growth scenario of multicellular spheroids that consists of a sequence of differ- 
ent growth regimes, in which the length of each phase depends on the particular 
growth conditions. We argue that the experiments of Freyer and Sutherland [18] 
and of Folkman and Hochberg [16] which have been performed under different 
growth conditions show different parts of the growth sequence. 




days 



Figure 1. Folkman and Hochberg [16] studied the growth of iso- 
lated spheroids from V-79 Chinese hamster lung cells, repeatedly 
transfered to new medium, (a): a cross-section of a V-79 spher- 
oid is shown, 1.0 mm in diameter and 20 days old. Viable cells 
are labeled with ['^H]thymidine; (b): mean diameter and standard 
derivation of 70 isolated spheroids of V-79 cells (reproduced from 
[16]). 



2. Models 

2.1. Off-Lattice Approach 

Guided by the observation that isolated cells in cultures or suspensions often have 
a spherical shape the off-lattice (three-dimensional) model assumes that each cell 
is spherical directly after cell division and deforms into a dumbbell during mitosis 
(Fig. 2), i.e. actively changes its ’’equilibrium shape”. 
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Figure 2. Illustration of the cell division algorithm. During cell 
division, a cell deforms from a perfect sphere of radius R into a 
dumbbell. 



To each assembly of cells a (’’total”) energy is assigned summarizing 
nearest-neighbor interactions that result from the competition between attractive 
interactions due to adhesion molecules anchored in the cell membranes and repul- 
sive contributions firstly from the limited cell deformability and compressibility, 
secondly from the loss of membrane steric entropy. We model each cell as a ’’hard 
core” -particle with a soft, elastic, attracting shell of short range S R [13]. The 
hard-core mimics the excluded volume effect, which takes into account that cells 
cannot be arbitrarily compressed or deformed. The elastic, attractive shell ac- 
counts for the deform-ability of a cell and the existence of adhesion molecules in 
its membrane. If cells are to far apart (distance > 2R + S) they do not interact. 
Active cell deformations during mitosis (Fig. 2) cause a pressure on the neigh- 
bor cells into the direction of the deformation. This leads to an increase of the 
total energy We assume that the neighbor cells either move their (tenter of 

mass or change their orientation in order to minimize We further assume 

that (i) inertial terms are small compared to dissipative terms and (ii) processes 
not explicitly considered, such as the cell metabolism, intra-cellular movements of 
the cytoplasm and the reorganization of the cytoskeleton give rise to a stochastic 
component in the displacement of the cells. Accordingly, for isolated cells a purely 
random movement takes place. 

Although growth is intrinsically a non-equilibrium phenomenon, the dynam- 
ics has been modeled by the Metropolis method which corresponds to the nu- 
merical integration of a Master equation [11, 23, 24]. This is justified since after 
each growth step all cells attempt to relax at least into a local equilibrium con- 
figuration [11]. In the simulations, cells are randomly chosen to perform either a 
translation, rotation or growth trial with small step sizes. Each translation or ro- 
tation is accepted with probability Pa = 1, if — F/"^ < 0 and with 

probability Pa = exp(— AF^^Y^/’) AF^^^^ > 0 (hence isolated cells move diffu- 

sively in accordance with refs. [25, 27]); t is time. At the time duration between 
two migration or rotation trials. P/’ is an effective energy and the fc/^T-analogue 
in cellular systems (T: temperature, kjf. Boltzmann const.) [3]. A growth trial is 
accepted only if it does not result in too strong cell deformations. This has the 
consequence that sufficiently large external mechanical stress is able to trigger a 
stop of further tumor growth [21]. Between two growth trials a cell performs on 
the average many (^1) translation and rotation trials. A cell not subject to any 
excluded volume interactions (i.e. in the absence of hard core interactions) has 
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Figure 3. (a): Temporal development of the tumor diameter L 
(in iim) vs. time (in days) in the computer simulation (dashed 
line) and in the experiment from ref. [18]. (b): Snap shot of the 
tumor morphology in the computer simulation at 20000 

cells 3.t t ^ 9d. In the computer simulation we used parameters 
estimated from refs. [3, 6, 18] to r ^ 8.5/i, I ^ 18/im, Ft ~ 
2 X 10“ Z) 1.05 X 10 ~ ^^cm^/s and E ^ 343Pa. 



an average ’’intrinsic” cycle time r. If excluded volume interactions occur some 
growth trials are rejected (those which would lead to an overlap of the hard core 
of neighboring cells which would correspond to unbiologically large cell deforma- 
tions or compressions) and the real average cycle time becomes rn > r. In order to 
relate the parameter values to real physical quantities we introduce a length scale 
(the cell diameter immediately after mitosis), a time scale (r) and an energy scale 
(F/ ) as reference scales. The intrinsic cycle time r and the diffusion constant D 
can be expressed in terms of the model parameters. Furthermore, the shape of the 
potential can be expressed in terms of the geometric cell quantities and the Young 
modulus E (for details, see [13]). A cell subject to compression between two plates 
deforms as long as its diameter is still larger than its incompressible (hard) core. 
The size of the Young modulus determines the strength of the force neccessary for 
the deformation. 

Fig. 3a shows a typical growth curve obtained with this model in a computer 
simulation (which started with one initial cell, i.e. N = 1). Fig. 3b shows the 
corresponding morphology. The crossover from exponential to linear growth of the 
tumor diameter can be explained solely by a biomechanical origin. The line or 
argument is that cells during mitosis can only exert a force of limited strength on 
their neighbors. If the number of cells that must be pushed aside by a dividing 
cell in order to gain free space for its division exceeds a certain threshold the 
cell is not able to proceed in division anymore but stops in one of the cell cycle 
checkpoints. For the sufficiently small cell populations at small times t {t < 2d in 
Fig. 3a) none of the cells has a number of neighbors larger than the number it is 
able to push aside hence all cells are able to divide and the population increases 
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exponentially fast. Above a certain population size (of 100 — 1000 cells in the 
computer simulations corresponding to a tumor diameter of L = 100/im), cells in 
the interior of the tumor become incapable to exert a sufficiently large pressure 
on their neighbor cells in order to divide, hence cell proliferation becomes limited 
to a surface layer of finite size. This leads to a crossover to linear growth of the 
tumor diameter and a power law growth N (x for the number of cells. 

Although certainly a lack of nutrients or oxygen in the interior of the tumor 
is an alternative mechanism to account for a constant surface layer of proliferating 
cells, and, in combination with further processes able to explain the occurrence of 
saturation (see next section), biomechanical effects on growth have been experi- 
mentally observed. For example, Helmlinger et. al. [21] found that a tumor grows 
slower and growth saturation occurs at smaller tumor sizes if the external pressure 
on the tumor increases. Furthermore, Bru et. al. [4] observed a linearly growing 
tumor diameter in d = 2 dimensions in an experimental setting which suggests 
that no nutrient limitation occurs. Hence the crossover from exponential to linear 
growth is possible without nutrient limitation. 

2.2. Cellular Automaton Model 

If the nutrient or oxygen concentration in the growth medium is sufficiently small 
the growth of tumor spheroids is observed to saturate early [16], [19]. We pro- 
pose a hybrid cellular automaton model that gives a potential explanation for 
the crossover from linear growth to saturation of growth of the tumor diameter. 
The model is defined on a lattice and describes the dynamics of discrete cells (tu- 
mor cells (C) and necrotic “cells” (N)) and the diffusion of nutrient and signal 
molecules. The model formulation is in terms of a lattice-gas automaton on a two- 
dimensional square lattice, in particular with each lattice node (r = (ri,r 2 )) four 
velocity channels c\ = (1,0), = (0,1), = ( — l,0),c.i = (0,-1) and one “rest- 
ing” channel Cr, = (0,0) are associated. Accordingly, cells in the automaton are 
assigned an orientation. There is an exclusion principle, i.e. each channel can be 
occupied by at most one tumor or necrotic cell (Fig. 4). Nutrient is consumed by 
proliferating and quiescent tumor cells. Tumor cells burst if they become necrotic. 
It is assumed that thereby a necrotic signal leaks into the surrounding tissue that 
induces an up-gradient movement of tumor cells (cheniotaxis). The chemotactical 
assumption is motivated by the experimental observation that there is a significant 
number of cells which drift from the viable rim of spheroids towards the necrotic 
core [8, 9]. 

Cell kinetics is considered in a usual way (Fig. 5). According to these on-node 
rates tumor cells at a node either proliferate (i.e. divide, if unoccupied channels 
exist), remain quiescent, die or become necrotic. Nutrient is consumed by prolifer- 
ating and quiescent tumor cells at a rate {cmu)- Note that the presence of necrotic 
material at a node leads to complete inhibition of mitosis and might even act 
toxic for all tumor cells present at that node. All cells propagate simultaneously 
according to their orientation only cells residing in “rest channels” do not move. 
Redistribution of cells is defined by rules specifying adhesion, pressure (cells are 
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Figure 4. Example of a cell configuration at a lattice node r. 
The dark gray filled circle and the light gray filled circle denote 
the presence of a tumor and a necrotic “cell” , respectively. 




Figure 5. Examples of mitosis {pm), apoptosis {pd) and necrosis 
(pn) rates depending on the nutrient concentration {cmu) and the 
specific micro-environment (i.e. node configuration; n^s # tumor 
cells and nyy^ # necrotic cells at the node). 



pushed towards neighborhoods with low cell density) and chemotactic motility 
(tumor cells move into the direction of the maximal necrotic signal gradient). 

The model dynamics is schematically summarized in Figure 6. The complete 
model definition can be found in [10]. The automaton is scaled according to exper- 
imentally obtained values, particular with respect to typical tumor cell size, time 
scale and diffusion length (compare [10]). 

We have performed simulations of a d= 2-dimensional tumor cross section 
starting from a small number of active tumor cells and applying realistic parameter 
sets (Fig. 7). Parameters incorporated in the automaton rules are glucose uptake 
rate, critical glucose concentration and doubling times for V-79 cells [17, 22, 30]. 
In the simulations nutrient is periodically applied and the chemical signal simul- 
taneously removed (every hour) outside of the tumor, i.e. at nodes which have no 
tumor or necrotic cell material in their neighborhood. This mimicks the experi- 
mental conditions of [16]. Furthermore, the size of the lattice (200 x 200 nodes = 
8x8 mm) is chosen sufficiently large such that the boundaries do not influence 
the tumor growth within the considered time interval. The self-organized forma- 
tion of a layered pattern comprised of a central necrotic core, a rim of quiescent 
tumor cells and an outer thin ring of proliferating cells can be observed. After 
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Figure 6. Schematic representation of the model dynamics. Pa- 
rameters: n^c # tumor cells and nA^: # necrotic cells at a node; 

Cjiut'- nutrient and Csig'. signal concentration at a node, tnwU criti- 
cal nutrient concentration, Cmu' nutrient consumption of a tumor 
cell, c^ig'. signal production during necrosis. 

an initially (for times t < lOd) linear growth of the tumor diameter and almost 
quadratic growth of the number of cells (which in d=3 dimensions is expected to 
be cubic) this is accompanied by the increase of the necrotic “cell” population and 
simultaneous decrease of the tumor cell number (see Fig. 8b). Purely local rules 
(direct cell-cell interactions) allow for the transition from an initial small number 
of tumor cells to the final structured tumor. 

A necessary condition for growth saturation during avascular growth even 
in the case of periodic nutrient supply is to guarantee a tendency of tumor cell 
motion into the direction of the necrotic core. Otherwise the tumor would continue 
to expand until finally the tumor compound would break up as a result of necrotic 
material dissolution. The “antagonistic growth direction” can be induced by the 
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chemotactic migration of tumor cells into the direction of the maximum necrotic 
signal gradient. This mechanism produces a cell flow towards the center. Initially, 
the inward flow is small since the necrotic core is not existing or small. Accordingly, 
the out-moving cell population dominates, i.e. the tumor expands. But, later in 
development if the necrotic core has reached a critical size the inward flow takes 
over which limits further growth. 
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Figure 7. Simulation of tumor growth with the cellular automa- 
ton. A layered tumor hcis formed, comprised of necrotic “cell” ma- 
terial, quiescent and proliferating tumor cells. Parameters: initial 
tumor cell number: 44 (after 50 days the tumor cell number is 946, 
comp. Fig. 10), mitosis rate Pm = 0.05, apoptosis rate pd — 0.01, 
necrosis rate pn = 0.008, rate for N dissolution qd = 0.0005, pro- 
duction rate for chemical signal Csig = 1, decay rate for chemical 
signal 0.8, strength of chemical signal Sgtg = 0.8, lattice size 200 x 
200. 
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Figure 8. Evolution of (a) diameter and (b) cell number of 25 
tumor growth simulations with a necrotic signal included (C: tu- 
mor cells, N: necrotic “cells”). 

3. Discussion 

We presented two types of individual-based models for in vitro tumor growth. One 
is based on a Brownian-particle-like approach, the second on a hybrid lattice-gas 
cellular automaton. Both approaches are able to model quantitatively different 
stages in the development of avascular tumors. The results obtained in the com- 
puter simulations suggest that a growing avascular tumor passes through different 
growth regimes: (1) For small times it grows exponentially fast, since cell division 
is neither limited by a too low nutrient or oxygen concentration, nor by a too large 
compression of cells in the tumor interior. (2) For longer times the nutrient and/or 
oxygen concentration as well as biomechanical effects are able to limit cell prolifer- 
ation to a surface layer resulting in linear growth of the tumor diameter. (3) As the 
tumor exceeds a certain size necrosis can lead to growth saturation. The particular 
cell type and the experimental growth conditions may have an significant infiuence 
on the duration of each growth regime. This line of argument is supported by a 
series of experiments under different nutrient and oxygen conditions (see Freyer 
and Sutherland [19]). The experiments suggest that a sufficiently strong limitation 
of glucose and oxygen may reduce the length of the linear growth period so much 
that it may even not be experimentally observed anymore. The results for the 
Brownian particle-like approach were obtained in d = 3, for the lattice-gas cel- 
lular automaton approach in d = 2 dimensions. The qualitative growth behavior 
of the tumor diameter is expected to remain unaffected by this difference in the 
dimensionality. If one assumes the volume of the tumor grows approximately as 
V (X N before necrosis occurs, then N grows exponentially fast in the initial and 
^LS N X X in the intermediate regime (where L xt). On the other hand, if 
L saturates, N saturates as well. 

Further modeling steps could involve the analysis of potential treatment ef- 
fects on tumor growth and morphology [12, 13], and the interaction between the 
tumor cells and the immune system. We believe that both modeling approaches 
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in principle provide appropriate tools for such an analysis. The Brownian-particle- 
like approach operates completely lattice-free; hence geometric effects due to any 
lattice symmetries cannot occur. Kinetic and biomechanical parameters can be 
directly linked to the model parameters. 

The lattice-gas cellular automaton is “rule- based” . As in hydrodynamic lattice- 
gas automata, automaton rules are not designed primarily in order to provide a 
direct copy of the physical system at a particular (here: cellular) length scale. 
The ’’automaton philosophy” is to develop a simplified microscopic caricature of 
the natural system which however grasps the essential dynamics at the partic- 
ular length scale considered (compare [20]). The CPU-time requirement for the 
lattice-gas cellular automaton is much smaller than for the off-lattice Brownian 
particle-like approach (hence the former allows to proceed to larger system sizes). 

An important advantage to use individual-based approaches over locally av- 
eraged cell densities is that local properties of cells on small length scales, as the 
detachment of a single cell from the primary tumor that may precede metastasis 
formation, or (de-)differentiation and apoptosis (if only a small number of cells 
at special positions are concerned) can more easily and much more directly be 
represented in an individual-based approach. Furthermore, stochastic effects as in- 
trinsic noise due to the discreteness of the system and molecular processes within 
and outside the cell that appear as stochastic elements in the cell migration and 
division can naturally be included in an individual-based approach. While surface 
and bulk properties of growing tumors in different growth stages in most cases 
have to be considered separately by continuum equations (e.g. one for the position 
of the surface, one for the cell density) they are a natural outcome in the computer 
simulation with the individual-based approach. However, a closed formulation in 
terms of mathematical equations, if it exists, simplifies the identification of biolog- 
ical mechanisms while in computer simulations mechanisms are sometimes hidden 
behind complicated parameter dependencies which are difficult to analyze. Inso- 
far both model types, the individual-based approach and the local density-based 
approach are complementary and both can contribute to a better understanding 
of tumor growth and treatment. The final decision whether a proposed biological 
mechanism is present or not is clearly provided by the experiment. 
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1. Introduction 

The development of a primary solid tumour (e.g. a carcinoma) begins with a 
single normal cell becoming transformed as a result of mutations in certain key 
genes. This transformed cell differs from a normal one in several ways, one of the 
most notable being its escape from the body’s homeostatic mechanisms, leading 
to inappropriate proliferation. An individual tumour cell has the potential, over 
successive divisions, to develop into a cluster (or nodule) of tumour cells. Fur- 
ther growth and proliferation leads to the development of an avascular tumour 
consisting of approximately 10^’ cells. This cannot grow any further, owing to its 
dependence on diffusion as the only means of receiving nutrients and removing 
waste products. For any further development to occur the tumour must initiate 
angiogenesis-the recruitment of blood vessels. Once angiogenesis is complete, the 
blood network can supply the tumour with the nutrients it needs to grow further. 
There is now also the possibility of tumour cells finding their way into the cir- 
culation and being deposited at distant sites in the body, resulting in metastases 
(secondary tumours) . 

A crucial part of the invasive/metastatic process is the ability of the cancer 
cells to degrade the surrounding tissue or extracellular matrix (ECM) [12]. This is a 
complex mixture of macromolecules (MM), some of which, like the collagens, play 
a structural role and others (such as laminin, fibronectin and vitronectin) are im- 
portant for cell adhesion, spreading and motility. We note that all of these macro- 
molecules are bound within the tissue, i.e. they are non-diffusible. The ECM can 
also sequester growth factors and itself be degraded to release fragments which can 
have growth-promoting activity. Thus, while the ECM may have to be physically 
removed in order to allow a tumour to spread, its degradation may, in addition, 
have biological effects on tumour cells. 

A number of matrix degradative enzymes (MDEs) such as the plasminogen 
activator (PA) system and the large family of matrix metalloproteinase s (MMPs) 
have been described [15] and both of these have been repeatedly implicated in tu- 
mour invasion and metastasis. In addition to opening migratory pathways, MDEs 
can alter cell adhesion properties regulated through several classes of cell surface 
receptors. These receptors, including cadherins, CD-44, integrins, and receptors for 
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fibronectin, laminin, and vitronectin, negatively regulate cell motility and growth 
through cell-cell and cell-matrix interactions [12]. Therefore, proteolytic degrada- 
tion of receptor and/or ECM components could release tumour cells from these 
constraints. Recent studies have shown that CD-44 mediates the attachment of 
cells to various MM, in fact invasion of human glioma cells has been inhibited 
by antibodies against CD-44 [9]. Molecules which facilitate interactions between 
cells and between cells and the ECM, known as cell adhesion molecules, are now 
thought to be central to the invasive process [8]. Therefore it is important for 
any model that considers tumour invasion to include both cell-cell and cell-matrix 
interactions. 

Tumour heterogeneity at the genetic level is well known and the so called 
“Guardian of the Genome”, the p53 gene is widely considered as a precursor to 
much wider genetic variation [10]. The p53 protein links three cellular functions: 
proliferation, death and DNA repair. In normal cells, p53 blocks proliferation and 
enables damaged DNA to be repaired. If DNA repair is incomplete, apoptosis is 
initiated and the cell dies. Loss of p53 function (e.g. through mutation) allows 
for the propagation of damaged DNA to daughter cells [10]. As a step towards 
the inclusion of true tumour heterogeneity, we shall consider a tumour that has 
phenotypic heterogeneity, with p53 being the only specific gene considered. Its 
effect is to simply allow the genetic mutations to begin and to, of course, allow 
the tumour cell to survive. The tumour cell phenotype will be defined here by 
the level of the cell’s aggressiveness, i.e. a combination of its cell-cell adhesiveness, 
proliferation, degradation and migration rates (further details will be discussed 
below) . 

The aim of this chapter is to examine the effects of tumour cell heterogeneity 
upon the overall spatial structure of the tumour and to discuss the importance of 
the roles of cell-cell and cell-matrix interactions. 



1.1. The PDE Model of Invasion 

We will base our mathematical model on generic solid tumour growth, which we 
will assume has just been vascularised, i.e. a blood supply has been established. 
We choose to focus on four key variables involved in tumour cell invasion, thereby 
producing a minimal model, namely: tumour cell density (denoted by n), MDE 
concentration (denoted by m), MM concentration (denoted by /) and oxygen 
concentration (denoted by c). Initially we define a system of coupled nonlinear 
partial differential equations to model tumour invasion of surrounding tissue and 
use these as the basis for the Hybrid Discrete- Continuum (HDC) technique. 

The complete non-dimensional system of equations describing the interac- 
tions of the tumour cells, MM, MDEs and oxygen is 
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where dn, dm and dc denote the tumour cell, MDE and oxygen diffusion coefficients 
respectively, 7 denotes the haptotaxis coefficient and 77 , n, g, u, uj and (p are positive 
constants. Since this model represents a development of the work by Anderson et 
al. [2], we will not discuss its derivation here. However, some explanation should be 
given for the inclusion of oxygen in the model. Oxygen is assumed to diffuse into 
the MM, decay naturally and be consumed by the tumour. For simplicity oxygen 
production is proportional to the MM density. This is a crude way of modelling 
an angiogenic oxygen supply, see [1] for a more appropriate way of modelling the 
angiogenic network. We should also note that cell-matrix adhesion is modelled here 
by the use of haptotaxis in the cell equation, i.e. directed movement up gradients 
of MM (see [2] for more details). 



1.2. Parameterisation 

In order to use realistic parameter values we have already non-dimensionalised the 
equations (see [2] for more details). We have rescaled distance with an appropriate 
length scale L (e.g. the maximum invasion distance of the cancer cells at this early 
stage of invasion, approximately 1cm), time with r (e.g. the average time taken for 
mitosis to occur, approximately 8 — 24hrs [5]), tumour cell density with no, ECM 
density with /o, MDE concentration with mo and oxygen concentration with Co 
(where no,/o,mo,Co are appropriate reference variables). Note, in the following 
paragraph a capitalised variable represents the dimensional form of the lower case 
variable. 

The cell cycle time can be highly variable and really depends on the specific 
tumour we are dealing with, as a rough guide we take r = 16hrs [5]. We take 
the cell motility parameter Dn ~ 10~'Ami‘^s~^ [4]. The haptotactic parameter 
r ~ 2600cm^s”^M~^ [1] and the parameter /o ~ 10“^ — 10“‘^M [14]. We took Dm 
to be 10~'^cm'^s~\ which is perhaps small for a diffusing chemical, however, recent 
experimental evidence implies that it is, in fact, a combination of the MDE and 
MM which results in degradation of the MM and that this bound chemical diffuses 
very little [7]. Estimates for the other parameters were not available since these 
are difficult to obtain experimentally and we therefore use the values of Anderson 
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et al. [2]. Oxygen is known to diffuse through water at a rate of Dc = 

and cells consume oxygen at a rate of 6.25 x 10~^^Mcell“^s“^ [6], and we take 

Co = 6.7 X 10-^MO2cm-^^ [11]. 



1.3. The HDC Model 



The HDC technique (see [l]-[3]) will be used to follow the path of an individ- 
ual tumour cell and initially involves discretising (using standard finite-difference 
methods) the system of partial differential equations (1). We then use the resulting 
coefficients of the five-point finite-difference stencil to generate the probabilities of 
movement of an individual cell in response to its local milieu (see Appendix of [2], 
for the full discrete system). For clarity we only consider the tumour cell equation, 
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The coefficient Pq, which is proportional to the probability of no movement, has 
the form. 
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where the subscripts specify the location on the grid and the superscripts the time 
steps. That is x = ih, y — jh and t = qk where z, j, /c, q and h are positive 
parameters. 

The central assumption in the HDC technique is that the five coefficients Po 
to P4 are proportional to the probabilities of the tumour cell being stationary (Po) 
or moving left (Pi), right (P2), down (P3) or up (P4). Prom the above probabilities 
we see that if there were no MM the values of P] to P 4 would be equal, with Po 
smaller (or larger, depending on the precise values chosen for the space and time 
steps) i.e. there is no bias in any one direction and the tumour cell is less (more) 
likely to be stationary-approximating an unbiased random walk. However, if there 
are gradients in the MM, haptotaxis contributes to the migration process and 
the coefficients Po to P4 will become biased towards the direction of increased 
MM concentration. The motion of an individual cell is therefore governed by its 
interactions with the matrix macromolecules in its local environment. Of course 
the motion will also be modified by interactions with other tumour cells. 



1.4. Individual-Based Processes 

We shall now discuss in detail the processes each tumour cell will experience as 
it migrates through the MM, driven by the movement probabilities defined in the 
previous section. 
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Life Cycle: At each time step a tumour cell will initially check if it can move with 
regards to cell-cell adhesion restrictions (see the next paragraph for criteria), if it 
can then the movement probabilities (above) are calculated and the cell is moved. 
A check is then made if the cell should die (see death paragraph for criteria) or 
not. If it should not die, its age is increased and a check to see if it has reached 
proliferation age is performed. If it has not reached this age then it starts the 
whole loop again. If proliferation age has been reached then a check is made to see 
if the criteria for proliferation are satisfied (see proliferation paragraph for details). 
If proliferation criteria are not met then the cell becomes quiescent. If they are 
satisfied then we check to see if this mitosis results in a p53 mutation. If a p53 
mutation does not result then death occurs, if it does then further mutations are 
possible (see mutation paragraph for details). This whole process is repeated at 
each time step of the simulation. 

Cell- Cell Adhesion: To model cell-cell adhesion explicitly we assume each cell has 
its own internal adhesion value (Aj), i.e. the number of neighbours that it requires 
before it can migrate. We therefore examine the number of external neighbours 
each cell has (A(.) and if A<> > Aj then the cell is allowed to migrate, otherwise it 
remains stationary. Whilst this is a somewhat crude way of modelling cell adhesion, 
it nevertheless captures the essential feature of cell-cell adhesion mediated contact 
inhibition i.e. migration is restricted due to contact with neighbouring cells. 

Death: For the tumour cell to survive it requires sufficient oxygen. Since some tu- 
mour cells have been found to survive in very poorly oxygenated environments, we 
make the assumption that the concentration has to drop to 0.05 non-dimensional 
units (where 1 would be the initial concentration) for cell death to occur. This 
assumption is also applied to quiescent tumour cells. The space that dead cells 
occupy becomes available to new cells as soon as they die. Cell death can also 
occur if the cell does not undergo a p53 mutation after the first proliferation (we 
assume that each cell has a probability Ppr^:] = 0.1 of a p53 mutation). As with all 
the mutations considered here, they can only occur as a result of proliferation. 

Proliferation: In our model we assume that each individual cell has the capacity for 
proliferation and will produce two daughter cells provided: (i) the parent cell has 
reached maturity (Mhrs, see Table 1) and (ii) there is sufficient space surrounding 
the parent cell for the two new daughter cells to occupy. In order to satisfy condi- 
tion (ii), we assumed that one daughter cell replaces the parent cell and the other 
daughter cell will move to any one of the parent cell’s four orthogonal neighbours 
that is empty. If more than one of the neighbouring grid points is empty then the 
new cell position is chosen randomly from these points. If no empty neighbours 
exist then the cell becomes quiescent and proliferation is delayed until space be- 
comes available. Quiescent cells are assumed to be non- motile and consume half 
the oxygen of normal cells. 
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Phenotype 


Prolif Age M 


O 2 Uptake 


MDE Production 


Ai 


Haptotaxis 


I (orange) 


16hrs 


U) 


AC 


3 


7 


II (green) 


14hrs 


4/3w 


CO 


2 


4/37 


III (cyan) 


12hrs 


2uj 


2 ac 


1 


27 


IV (blue) 


8hrs 


4 w 


4 ac 


0 


47 



Table 1. Parameter values for each of the four different Pheno- 
types. Colours are used to identify the different phenotypes in the 
simulation results below. 



Mutation: Initially all cells are assumed to have wild-type p53 i.e. non-mutated. 
After a p53 mutation occurs the genome becomes unstable and is then open to 
many more mutations. So after the p53 mutation has occurred, the cell now has 
predefined phenotypic traits that specify its behaviour. For simplicity we consider 
four phenotypes, each progressively more aggressive (in terms of invasiveness) than 
before. Table 1 shows the different values each phenotype takes and clearly type IV 
is the most aggressive, having the shortest proliferation age, consuming the most 
oxygen, producing the most MDE, having the largest haptoaxtis coefficient and 
requiring no neighbours for migration. Following the first p53 mutation a cell is 
assigned the values of phenotype I (as defined in Table 1) and for each subsequent 
proliferation there is a small probability {P,nutat ) of further mutations occurring 
which will lead to phenotype II and so on in a linear fashion. All mutations are 
assumed to be irreversible. 

Production/ Degradation: Since we are modelling individual tumour cells we must 
consider MDE production at the level of a single cell. In the continuum model 
(1) we have MDE production as being proportional to the tumour cell density. 
Now MDE is only produced at a grid point if a tumour cell is occupying that 
grid point. Since we have no precise parameter estimates for this production rate, 
we take n = 1 in the discrete form of the MDE equation when a tumour cell is 
occupying the current location and take n = 0 otherwise. Similarly for oxygen 
uptake, we take n = 1 (since uj is scaled as per cell) in the discrete form of the 
oxygen equation when a cell is consuming oxygen at the current location and n = 0 
otherwise. 

1.5. HDC Model Simulation Results and Discussion 

The following simulations were carried out on a 400 x 400 grid, which is a dis- 
cretisation of the unit square, [0, 1] x [0, 1], with a space step of h = 0.0025 and 
a time step of /c = 0.0005. No flux boundary conditions were imposed on the 
square grid, restricting the tumour cells, MDE, MM and oxygen to within the 
grid. Initially, 50 tumour cells are centred around (0.5, 0.5) with a random age 
between 0/irs - 16/irs, the MDE concentration was zero throughout the domain 
(m{x,y) = 0) and the oxygen concentration was taken to be one {c{x,y) = 1). 
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FiCiURE 1 . Homogeneous tumour results, spatial distribution of 
tumour cells, MDE, MM and oxygen (clockwise) at time t = 200 
units (i.e. 200 generations, approximately 133 days). (For colored 
picture and more detailed explanation see color plate 13). 



We consider the effects upon tumour invasion, of three different MM initial dis- 
tributions: (i) homogeneous {f{x,y) = 1 ), (ii) heterogenous (0 < f{x,y) < 1 ) and 
(iii) random (0 < f{x,y) < 1). For clarity we shall label the resulting tumour 
cell distributions as (i) homogeneous tumour, (ii) heterogeneous tumour and (iii) 
random tumour. The non-dimensional parameter values used in all the following 
simulations are d,, = 0.0005, = 0.0005, = 0.5, 7 = 0.01, ry = 50, /^ = 1, 

cr = 0 , E = 0.5, uj = 0.57 and 0 = 0.025. We also take the phenotype mutation 
probability to be Pjmitat =0.1. Other values were considered and produced similar 
results but for shorter or longer times depending on whether the probability was 
smaller or larger. 

Figure 1 shows the simulation results for each of the four variables for the 
the homogeneous initial MM distribution at ^ = 200 time units. The tumour cell 
distribution shows a mainly dead central region with a thin dispersed proliferating 
boundary. As might be expected, it is the most aggressive cells of phenotype IV 
which have survived and continue to proliferate on the boundary, although a small 
cluster of cells has survived in the centre of the tumour. This is partly due to the 
fact that quiescent cells consume less oxygen and therefore allow a little oxygen to 
diffuse back into the centre of the tumour. Given more time, these cells will also 
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Figure 2. Heterogeneous tumour results, spatial distribution of 
tumour cells, MDE, MM and oxygen (clockwise) at time t = 200 
units. (For colored picture and more detailed explanation see color 
plate 14). 



die due to lack of oxygen. However, this does imply that even necrotic regions may 
still offer some potential for tumour cell survival (in the short term). Since the cells 
were invading through an initially homogeneous distribution of MM, it is perhaps 
not too surprising that this has produced the most invasive and symmetric tumour 
(this symmetry is also seen in all three of the other variables). The homogeneous 
tumour has also produced the largest number of individual cells, this is due to the 
faster invasion rate which gives access to empty space for proliferation leading to 
further invasion. The faster invasion is mainly driven by the cell-matrix interactions 
via haptotaxis, giving directed motion towards higher concentrations of MM. Since 
the type IV cells are on the boundary they exploit this gradient the most by having 
the largest haptotaxis coefficient and no cell-cell adhesion dependence. 

Figure 2 shows the simulation results for an initially heterogeneous MM dis- 
tribution. The tumour cell distribution is now radically different geometrically, 
showing a more fingered morphology, although it still consists of just two cell 
types, dead cells and type IV cells. Again the most aggressive cells dominate the 
population and lead the way for invasion at the boundary of the tumour. Since 
these cells have no cell-cell adhesion dependence, their migration is mainly driven 
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Ficjure 3. Random tumour results, spatial distribution of tu- 
mour cells, MDE, MM and oxygen (clockwise) at time t = 200 
units. (For colored picture and more detailed explanation see color 
plate 14). 



by haptotaxis via the local MM gradients and it is these local gradients that ulti- 
mately define the tumour geometry. Since the initial MM distribution is somewhat 
artificially smooth in Fig. 1 and 2, we finally consider a completely random initial 
MM distribution in Fig. 3. Figure 3 shows that the tumour cell distribution again 
displays the fingered morphology, in a slightly more symmetric manner than the 
heterogeneous tumour. We also see that the tumour consists of dead cells and type 
IV cells, again leading the invading tumour boundary. 

Given that all three simulations use the same parameters, with the exception 
of the MM initial distributions, these results illustrate the importance of tumour 
cell-matrix interactions in aiding or hindering the migration of individual cells 
that define the tumour geometry. This result is dependent on the fact that the 
tumour cells mutate quickly enough to the most aggressive phenotype in order 
to exploit the changes in local MM gradients. If the cell-cell adhesion parameter 
were to be fixed for all phenotypes (e.g. Ai = 3), then roughly the same small 
circular tumour would result for all three MM distributions. However, it is known 
that tumour cells lose the ability (via mutations) to recognise cell-cell adhesion 
molecules at an early stage in their development e.g. E-cadherin is lost at an early 
stage of breast carcinogenesis and N-CAM loss in gliomas is associated with a high 
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probability of metastasis [13]. Therefore the results from the random tumour (Fig. 
3) perhaps represent the most realistic simulation, where the bias from the cell- 
matrix interactions is purely driven by local degradation of MM and subsequent 
creation of gradients which lead to directed migration of the tumour cells. The 
fact that the resulting tumour cell population consists of only one living cell type 
IV might not be surprising due to the linear nature of the mutations. However, 
it seems logical to assume that it will be the most aggressive tumour cells that 
dominate the tumour population. 

In conclusion, whilst cell-cell interactions are important at the early stages 
of the tumour’s development, subsequent loss of cell-cell adhesion molecules (via 
mutation) results in a tumour that is dominated by cell-matrix interactions. There- 
fore, these results predict that local tumour cell-matrix interactions are ultimately 
what define the overall geometry of the tumour and not the cell-cell interactions. 

Much more work needs to be done in examining the range of behaviour that 
this model can display. In particular the manner in which oxygen production is 
modelled needs to be refined perhaps by modelling more accurately the angio- 
genic network. Also the inclusion of a purely random phenotype would not only 
remove the linearity of the mutations but produce a more realistic model. There 
is certainly a need for a sensitivity analysis of the parameters, especially the mu- 
tation probabilities and the production/uptake parameters that are not known. 
Nonetheless, the results presented above show the importance of considering both 
cell-cell and cell-matrix interactions in a model of tumour invasion. Furthermore 
the strong dependence of the tumour geometry on local cell-matrix interactions 
may point the way for future cancer research and treatment. 
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1. Cancer, a Product of Somatic Evolution 

Cancers are genetic diseases and, although there may be elements of heritable pre- 
disposition as in some cases of colon carcinoma [1,2], essentially determined by 
somatic genetics. Varying numbers of chromosomal aberrations cts well as muta- 
tions in genes considered crucial for malignant transformation have been correlated 
with tumour development. However, a one-to-one correlation of a particular ge- 
netic abnormality translating into a malignant cell physiology with a particular 
tumour type, as it is the case for the bcr-abl translocation in CML and some acute 
leukemias, is the exception rather than the rule. Departing from the earlier single 
hit genetic concepts, cancers are now generally seen as the result of a number of ge- 
netic alteration that affect various cellular functions. As Weinberg and co-workers 
have argued for many years, there are three essential events that convert a normal 
cell into a tumour cell [3,4]. These essentials affect growth control which are the 
oncogene function, tumour suppressor gene function and telomerase activity. The 
initial models proposed by Vogelstein [5] for the ontology of colon carcinoma and 
subsequently by Isaacson [6, 7] for MALT lymphoma have postulated a defined 
succession of genetic alteration. However, as a result of intense research on cancer 
genetics and despite of recurring themes of oncogene functions and tumour sup- 
pressor gene defects, we are now faced with a picture of considerable heterogeneity 
of the factors, events, succession of events and paths that lead to malignant trans- 
formation and control malignant development. Mutations that distinguish tumour 
cells from their parental cell types have been numbered to be in the order of hun- 
dreds of thousands [8,9] and often tumour progression is found to go along with 
increasing genetic instability [10 12]. To accommodate this diversity in the genet- 
ics of malignant transformation an increasing number of researchers in the field 
interpret cancer as the product of an often continuous somatic evolution. In this 
context, Tsao and co-workers [13] have re-evaluated the genetics of the develop- 
ment of colon carcinoma and present clear evidence for early diversification and 
co-evolution of tumour cell lines of different malignancy. 

As for the evolution of species [14], diversification, isolation or compartmen- 
talization and selection are expected, and found, to be guiding principles in the 
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ontology and the development of malignant diseases. Diversity of the tumour 
cells in cancer has been reported for the morphology, histology, cytogenetics, 
molecular genetics, phenotypes, gene expression profiles and cell biology of the 
tumours. Immune-histochemical analyses done in many laboratories for tumour- 
associated antigens and cell surface proteins commonly reveal substantial variation 
between different tumour lesion within the same patient or even for different cells 
of the same lesion [15-21]. The fraction of tumour cells that expresses a particular 
tumour- associated antigen rarely exceeds fifty percent. 



2. Immune Surveillance of Tumours 

Burnet [22] and many others have long pointed out that multi-cellular, multi-organ 
species necessarily require instruments to detect and eliminate tumours. We now 
know of tumour suppressive mechanisms at the cellular level such as the products 
of tumour suppressor genes [23], at the tissue level such as angiogenesis-controlling 
and tissue-specific surviving factors [24], and at the level of the entire body which 
are constituents of the immune system [25,26]. Already in 1971, Burnet had iden- 
tified the T-lymphocytes as the crucial effector cells of tumour-specific immune 
responses [22]. But only much lafer, in 1975, the gene products of the MHC were 
found to control antigen recognition by T-lymphocytes [27] and it took some 15 
years more until the molecular mechanisms of MHC-dependent antigen presen- 
tation and recognition were understood [28,29]. MHC molecules are cell surface 
proteins that bind peptides generated inside the cells from protein antigens by 
limited proteolysis [30]. The resulting complexes are the ligands of the antigen 
receptors of T-lymphocytes. These peptides or T-cell epitopes are usually derived 
from proteins produced by the cell. The MHC-dependent mode of antigen recogni- 
tion, thus, enables the immune system to sense molecular alterations inside cells. 
The first tumour-cissociated antigens and respective T-cell epitopes were identified 
in the late eighties and early nineties [31], about 20 years after Burnet published 
his immune surveillance hypothesis [25]. To date about 250 epitopes are known 
representing about 60 antigens [32], which are tumour-specific mutations, tumour- 
testis antigens, viral antigens and, most frequently, differentiation antigens, mean- 
ing, they are expressed in the tumour cells as well a>s in normal, non-transformed 
cells of the tumour’s histotype. Anti-tumour immune responses, thus, are often 
auto-reactive immune responses and vitiligo, sometimes seen in patients with ma- 
lignant melanoma, is an indication of such a tumour-aissociated auto-immunity [33] . 
However, vitiligo is rare and, when it occurs, restricted to individual foci at the 
patients skin. It appears that the T-lymphocytes can discriminate between a T-cell 
epitope presented by tumours cells or by a normal cell. It is still unknown how 
this discrimination is achieved. 
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3. Immune Evasion by Tumour Cells 

Although there is accumulating evidence that these immune reactions can detect 
and eliminate transformed cells and may prevent cancer right at the on-set, man- 
ifest cancer testifies to the failure of immune surveillance as well as to the failure 
of the other surveillance systems of the body. Tumour cells in cases of cancer are 
selected for survival and adapted to escape the impact of selection processes, be 
they inborn such as the action of tumour suppressor gene products and of the 
immune system or the result of therapeutic intervention like chemotherapy or im- 
mune therapy. As a result, tumours are expected, and have been shown, to bear the 
marks of these selection processes. The most striking examples of such selection 
processes are seen in recurrences after a successful chemotherapy, radiotherapy or 
immune therapy when the new variants of the tumour cells are now resistant to 
the very principle of the therapeutic intervention, i. e. they became resistant to 
chemotherapy through high expression of multi-drug resistant genes which code 
for a membrane transporter that indiscriminately shuffles drugs out of the cells [34] 
or, in the case of radiotherapy, they display a high degree of radioresistance. Im- 
mune evasion is particularly differentiated and can include loss of antigens [35,36] 
or of MHC molecules [37] that control antigen presentation and/or the expression 
of immune-suppressive factors such as IL-10 [38] and MIF [39]. In the course of 
cancer such immune evasion processes become increasingly prominent and may 
have systemic effects. 



4. Potentials and Limits of Cancer Immime Therapy 

Although the evasion mechanisms may be directly correlated with the principle 
of action of a natural anti-tumour response or of the therapy applied to the pa- 
tient, the evolution-biological concept of malignant development would see them 
as result of selection processes imposed on an already diversified tumour cell pop- 
ulation. By virtue of its specificity, and capacity to quickly adapt to new antigens 
and to expand in order to mount effective cytolytic responses the cellular arm of 
the immune system, of all the surveillance systems of the body, appears to be best 
suited to handle the diversity and adaptability of tumours [40] . There are indeed a 
number of observations that bear evidence of the existence and anti-tumour activ- 
ity of tumour-specific cytolytic T-lymphocytes in cancer patients. Such cells have 
been demonstrated both in the peripheral blood and in tumours [41-45]. They 
can be isolated and expanded in culture to be used to study their anti-tumour 
specificity and effector capacities. Anti-tumour vaccination W 2 is shown to induce 
mobilisation of such cells in the body and re-localisation to tumour lesions [46]. 
Tumours are usually infiltrated by lymphocytes. In fact, in some tumours such 
as early and intermediate stage cutaneous T-cell lymphomas, tumour-infiltrating 
lymphocytes can make up the bulk of the tumour mass [47]. A large fraction of 
these tumour-infiltrating lymphocytes are tumour-specific and can be expanded in 
culture, re-infused into the patients for therapeutic purpose [48]. It was shown that 
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these cells can persist in the peripheral blood for extended periods and home to the 
tumour sites. Moreover, the instruments of the immune system are not restricted 
to cytotoxic T-lymphocytes. Antigen loss, for instance, can render a tumour cell 
vulnerable to attacks by natural killer cells which respond to the lack of MHC ex- 
pression [49]. All this evidence for anti-tumour immune activity in cancer patients 
may go along with evidence for immune evasion and immune suppression so that 
the interrelationship of tumour and immune system appears to be characterised by 
processes of mutual adaptation. It is a dynamic relationship with shifting balances. 
There are observations of spontaneous remissions caused by immune reactions or 
of complete responses to tumour-specific vaccinations [50,51], but all too often the 
balance tips in favour of the tumour. Increasing immune suppression and failing 
anti-tumour immune responses usually correlate with tumour progression [38]. In 
malignant melanoma this battle between tumour and immune system can in some 
cases be seen at the primary tumour with the darkly pigmented tumour showing 
signs of active growth at the border zones surrounding areas of tumour regression 
evidenced by de-pigmentation areas. 



5. The Need for Combination Therapies for the Treatment of 
Cancer 

As in the evolution of species, tumours may become extinct when their propagation 
is not in balance with the resources of their environment, when the environmental 
conditions change to become less permissive or when they are eliminated by se- 
lection [14]. On the other hand, tumour cells in manifest cancer are expected, and 
have been shown, to be selected for survival, and adapted to escape the various 
surveillance systems of the body. The evolution-biological concept of cancer implies 
that whatever the current appearance and genetics of the malignancy is, it might 
just represent the majority variant [13]. Subsequent developments might sprout 
off obscure minority clones or variation of the dominant ones. If proven correct, 
this concept would call for new therapeutic strategies that address the potential 
heterogeneity of the tumour cells by combining different therapeutic principles. 
With the dominance of surgery, chemotherapy and radiation therapy, the basic 
concepts of cancer therapy have not changed during the past 50 years although 
there have of course been impressive improvements in all these three directions. 
Immunotherapy is being systematically explored only since a few years but now 
already it is clear that immunology will not provide all the answers to the cancer 
problem. Future, more effective therapies for cancers might have to be based on an 
understanding of the somatic evolution of cancer and the interplay of the factors 
leading to diversification and those that exert selective pressure. 



References 

[1] Vogelstein B. Cancer. A deadly inheritance, Nature, 348 (1990), 681-2. 




Interrelationship of Tumour and Immune System 



395 



[2] Cannon-Albright LA, Skolnick MH, Bishop DT, Lee RG, Burt RW, Common in- 
heritance of susceptibility to colonic adenomatous polyps and associated colorectal 
cancers, N Engl J Med, 319 (1988), 533-7. 

[3] Hahn WC, Counter CM, Lundberg AS, Beijersbergen RL, Brooks MW, Weinberg 
RA, Creation of human tumour cells with defined genetic elements, Nature, 400 
(1999), 464-8. 

[4] Hahn WC, Weinberg RA, Modelling the molecular circuitry of cancer, Nat Rev Can- 
cer, 2 (2002), 331-41. 

[5] Fearon ER, Vogelstein B, A genetic model for colorectal tumorigenesis. Cell, 61 
(1990), 759-67. 

[6] Wotherspoon AC, Pan LX, Diss TC, Isaacson PC, A genotypic study of low grade B- 
cell lymphomas, including lymphomas of mucosa associated lymphoid tissue (MALT), 
J Pathol, 162 (1990), 135-40. 

[7] Isaacson PC, Lymphomas of mucosa- associated lymphoid tissue (MALT), 
Histopathology, 16 (1990), 617-9. 

[8] Stoler DL, Chen N, Basik M, Kahlenberg MS, Rodriguez-Bigas MA, Petrelli NJ, 
Anderson GR, The onset and extent of genomic instability in sporadic colorectal 
tumor progression, Proc Natl Acad Sci USA, 96(1999), 15121-6. 

[9] Boland CR, Ricciardiello L, How many mutations does it take to make a tumor? 
Proc Natl Acad Sci U S A, 96 (1999), 14675-7. 

[10] Lengauer C, Kinzler KW, Vogelstein B, DNA rnethylation and genetic instability in 
colorectal cancer cells, Proc Natl Acad Sci U S A, 94 (1997), 2545-50. 

[11] Loeb LA, Micro satellite instability: marker of a mutator phenotype in cancer, Cancer 
Res, 54 (1994), 5059-63. 

[12] Thestrup-Pedersen K, Kaltoft K, G enotraumatic T cells and cutaneous T-cell lym- 
phoma. A causal relationship? Arch Dermatol Res, 287 (1994), 97-101. 

[13] Tsao JL, Tavare S, Salovaara R, Jass JR, Aaltonen LA, Shibata D, Colorectal ade- 
noma and cancer divergence. Evidence of multilineage progression. Am J Pathol, 154 
(1999), 1815-24. 

[14] Mayr, E. Change of genetic environment and evolution. In Huxley, J. Hardy, AC, 
Ford EB, eds.. Evolution as a process. 1954. 157-180. George Allen and Unwin Ltd., 
London. 

[15] Mocellin S, Fetsch P, Abati A, Phan GQ, Wang E, Provenzano M, Stroncek D, 
Rosenberg SA, Marincola FM, Laser scanning cytometry evaluation of MART-1, 
gplOO, and HLA-A2 expression in melanoma metastases. J Imniunother, 24 (2001), 
447-58. 

[16] de Vries TJ, Smeets M, de Graaf R, Hou- Jensen K, Brocker EB, Renard N, Egger- 
mont AM, van Muijen GN, Ruiter DJ. Expression of gplOO, MART-1, tyrosinase, 
and SI 00 in paraffin- embedded primary melanomas and locoregional, lymph node, 
and visceral metastases: implications for diagnosis and immunotherapy. A study con- 
ducted by the EORTC Melanoma Cooperative Group, J Pathol, 193 (2001), 13-20. 

[17] Orosz Z, M elan- A/ Mart- 1 expression in various melanocytic lesions and in non- 
melanocytic soft tissue tumours, Histopathology, 34 (1999), 517-25. 

[18] Pierard-Pranchimont C, Letawe C, Nikkels AF, Pierard GE. Patterns of the immuno- 
histochemical expression of melanoma- associated antigens and density of CD45R0P 




396 



P. Walden 



activated T lymphocytes and LI -protein positive macrophages in primary cutaneous 
melanomas. Int J Mol Med. 1998; 2, 721-4. 

[19] Cormier JN, Hijazi YM, Abati A, Fetsch P, Bettinotti M, Steinberg SM, Rosenberg 
SA, Marincola FM, Heterogeneous expression of melanoma- associated antigens and 
HLA-A2 in metastatic melanoma in vivo, Int J Cancer, 75 (1998), 517-24. 

[20] Kageshita T, Kawakami Y, Hirai S, Ono T. Differential expression of MART-1 in 
primary and metastatic melanoma lesions, J Immunother, 20 (1997), 460-5. 

[21] de Vries TJ, Fourkour A, Wobbes T, Verkroost G, Ruiter DJ, van Muijen GN Het- 
erogeneous expression of immunotherapy candidate proteins gplOO, MART-1, and 
tyrosinase in human melanoma cell lines and in human melanocytic lesions. Cancer 
Res, 57 (1997), 3223-9. 

[22] Burnet FM, Immunological surveillance in neoplasia. Transplant Rev, 7 (1971), 3-25. 

[23] Krug U, Ganser A, Koeffler HP, Tumor suppressor genes in normal and malignant 
hematopoiesis. Oncogene, 21 (2002), 3475-95. 

[24] Raff MC, Social controls on cell survival and cell death. Nature, 356 (1992), 397-400. 

[25] Burnet FM, The concept of immunological surveillance. Prog Exp Tumor Res, 13 
(1970), 1-27. 

[26] London NJ, Farmery SM, Will EJ, Davison AM, Lodge JP, Risk of neoplasia in renal 
transplant patients. Lancet, 346 (1995), 403-6. 

[27] Zinkernagel RM, Doherty PC, H-2 cornpatability requirement for T- cell-mediated ly- 
sis of target cells infected with lymphocytic choriomeningitis virus. Different cytotoxic 
T-cell specificities are associated with structures coded for in H-2K or H-2D, J Exp 
Med, 141 (1975), 1427-36. 

[28] Falk K, Rotzschke O, Stevanovic S, Jung G, Rammensee HG, Allele- specific mo- 
tifs revealed by sequencing of self-peptides eluted from MHC molecules, Nature, 351 
(1991), 290-6. 

[29] Bjorkman PJ, Saper MA, Samraoui B, Bennett WS, Strominger JL, Wiley DC, 
Structure of the human class I histocompatibility antigen, HLA-A2, Nature, 329 
(1987), 506-12. 

[30] Kloetzel PM, Antigen processing by the proteasome, Nat Rev Mol Cell Biol, 2 (2001), 
179-87. 

[31] van der Bruggen P, Traversari C, Chomez P, Lurquin C, De Plaen E, Van den 
Eynde B, Knuth A, Boon T, A gene encoding an antigen recognized by cytolytic T 
lymphocytes on a human melanoma. Science, 254 (1991), 1643-7. 

[32] Renkvist N, Castelli C, Robbins PF, Parmiani G, A listing of human tumor antigens 
recognized by T cells. Cancer Immunol Immunother, 50 (2001), 3-15. 

[33] Rosenberg SA, White DE, Vitiligo in patients with melanoma: normal tissue antigens 
can be targets for cancer immunotherapy, J Immunother Empheisis Tumor Immunol, 
19 (1996), 81-4. 

[34] Dietel M, Molecular mechanisms and possibilities of overcoming drug resistance in 
gastrointestinal tumors. Recent Results Cancer Res, 142 (1996), 89-101. 

[35] Geertsen R, Hofbauer G, Kamarashev J, Yue FY, Dummer R, Immune escape mech- 
anisms in malignant melanoma, Int J Mol Med, 3 (1999), 49-57. 




Interrelationship of Tumour and Immune System 



397 



[361 Pawelec G, Zeuthen J, Kiessline R, Escape from host- antitumor immunity, Grit Rev 
Oncog, 8 (1997), 111-41. 

[37] Seliger B, Maeurer MJ, Ferrone S, Antigen-processing machinery breakdown and 
tumor growth, Immunol Today, 21 (2000), 455-64. 

[38] Asadullah K, Docke WD, Volk HD, Sterry W, Cytokines and cutaneous T-cell lym- 
phomas, Exp Dermatol, 7 (1998), 314-20. 

[39] Mitchell RA, Bucala R, Tumor growth-promoting properties of macrophage migration 
inhibitory factor (MIF), Semin Cancer Biol, 10 (2000), 359-66. 

[40] Burnet FM, Implications of immunological surveillance for cancer therapy, Isr J Med 
Sci, 7 (1971), 9-16. 

[41] Balch CM, Riley LB, Bae YJ, Salmeron MA, Platsoucas CD, von Eschenbach A, 
Itoh K, Patterns of human tumor-infiltrating lymphocytes in 120 human cancers, 
Arch Surg, 125 (1990), 200-5. 

[42] Clemente CG, Mihm MC Jr, Bufalino R, Zurrida S, Collini P, Cascinelli N, Prog- 
nostic value of tumor infiltrating lymphocytes in the vertical growth phase of primary 
cutaneous melanoma, Cancer, 77 (1996), 1303-10. 

[43] Speiser DE, Lienard D, Pittet MJ, Batard P, Rimoldi D, Guillaume P, Cerottini JC, 
Romero P In vivo activation of melanoma- specific CD8(-h) T cells by endogenous 
tumor antigen and peptide vaccines, A comparison to virus-specific T cells. Eur J 
Immunol, 32(2002), 731-41. 

[44] Valmori D, Dutoit V, Lienard D, Rimoldi D, Pittet MJ, Champagne P, Ellefsen K, 
Sahin U, Speiser D, Lejeune F, Cerottini JC, Romero P, Naturally occurring human 
lymphocyte antigen-A2 restricted COST T-cell response to the cancer testis antigen 
NY-ESO-1 in melanoma patients. Cancer Res, 60 (2000), 4499-506. 

[45] Pittet MJ, Valmori D, Dunbar PR, Speiser DE, Lienard D, Lejeune F, Fleischhauer 
K, Cerundolo V, Cerottini JC, Romero P, High frequencies of naive Melan-A/MART- 
1 -specific CDS(-f-) T cells in a large proportion of human histocompatibility leukocyte 
antigen (HLA)-A2 individuals, J Exp Med, 190 (1999), 705-15. 

[46] Fisher B, Packard BS, Read EJ, Carrasquillo JA, Carter CS, Topalian SL, Yang 
JC, Yolles P, Larson SM, Rosenberg SA Tumor localization of adoptively trans- 
ferred indium-111 labeled tumor infiltrating lymphocytes in patients with metastatic 
melanoma, J Clin Oncol, 7 (1989), 250-61. 

[47] Jahn S, Asadullah K, Walden P, Sterry W, Cutaneous malignant lymphomas, Im- 
munol Today, 19 (1998), 100-3. 

[48] Rosenberg SA, Packard BS, Aebersold PM, Solomon D, Topalian SL, Toy ST, Si- 
mon P, Lotze MT, Yang JC, Seipp CA, et al. Use of tumor- infiltrating lymphocytes 
and interleukin-2 in the immunotherapy of patients with metastatic melanoma, A 
preliminary report. N Engl J Med, 319 (1988), 1676-80. 

[49] Moretta A, Bottino C, Vitale M, Pende D, Cantoni C, Mingari MC, Biassoni R, 
Moretta L, Activating receptors and coreceptors involved in human natural killer 
cell-mediated cytolysis, Annu Rev Immunol, 19 (2001), 197-223. 

[50] Kugler A, Stuhler G, Walden P, Zoller G, Zobywalski A, Brossart P, Trefzer U, 
Ullrich S, Muller CA, Becker V, Gross AJ, Hemmerlein B, Kanz L, Muller GA, 
Ringert RH, Regression of human metastatic renal cell carcinoma after vaccination 
with tumor cell- dendritic cell hybrids, Nat Med, 6 (2000), 332-6. 




398 



P. Walden 



[51] Trefzer U, Weingart G, Chen Y, Herberth G, Adrian K, Winter H, Audring H, Guo 
Y, Sterry W, Walden P, Hybrid cell vaccination for cancer immune therapy: first 
clinical trial with metastatic melanoma, Int J Cancer, 85 (2000), 618-26. 




Function and Regulation of Cellular Systems: 

Experiments and Models, 399-410 

ed. by A. Deutsch, M. Falcke, J. Howard, W. Zimmermann 
© 2004 Birkhauser Verlag Basel/Switzerland 



Nonlinear Dynamics and Statistical Physics of 
Models for the Immune System 

Ulrich Behn^, Markus Brede^ and Jan Richter^’^ 

^ Institute for Theoretical Physics, and 

^ Institute for Clinical Immunology, University of Leipzig, P.O.B. 100920, D-04009 Leipzig, 
Germany 



1. Introduction 

The immune system is a hierarchically organized natural adaptive system built 
by a macroscopic number of constituents which shows a very complex behaviour 
on several scales of temporal, spatial, and functional organization. In this paper 
we try to demonstrate the possible use of methods from nonlinear dynamics and 
statistical physics in understanding immunological problems of different levels of 
complexity, cf. also [1,2]. To this aim we consider two examples. 

The first example is Thl/Th2 regulation and allergy. T-helper lymphocytes 
have subtypes which differ in their spectrum of secreted cytokines. These cytokines 
have autocrine effects on the own subtype and cross-suppressive effects on the 
other subtype and regulate further the type of immunoglobulines secreted by B- 
lymphocytes. The balance of Thl- and Th2-cells is perturbed in several diseases. 
For example, in allergy the response to allergen is Th2-dominated. A widespread 
and successful therapy consists in the injection of increasing doses of allergen 
following empirically justified protocols of administration. We seek an explanation 
of this therapy studying the nonlinear dynamics of a nonautonomous system of 
few variables which describes the Thl/Th2 populations in the sense of a mean 
field theory. Indeed, the system is driven by proper injections of allergen towards 
new attractors, where the response is Thl-dominated as for healthy individuals. 
The target of the initial phase of the therapy with increasing doses of allergen is 
in our view not primarily the T-cell system, but it is to desensitize mast cells and 
basophils, so that the larger doses during the maintenance phase of the therapy do 
not cause allergic symptoms. This is supported by a model describing the dynamics 
of the mast cell stimulation and the Calcium response inside the cell that triggers 
the release of inflammatory mediators. 

Our second example is the idiotypic network of B-lymphocytes. B-cells ex- 
press on their surface receptors (antibodies) of a given specificity (idiotype). Cross- 
linking these receptors by complementary structures (antigen or antibodies) stim- 
ulates the lymphocyte to proliferate. Thus even without antigen there is a large 
functional network of interacting lymphocytes, the idiotypic network. Both the 
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Figure 1. Thl/Th2 regulation. Allergen presented by ABC’s 
stimulates naive cells to differentiate into Thl- or Th2-cells which 
differ in the spectrum of cytokines they secrete. The cytokines of 
a given T-helper subtype have autocrine effects on their own type 
and cross-suppressive effects on the other type. Cytokines of Thl- 
cells (Th2-cells) stimulate B-cells to produce IgC4 (IgE). Allergy 
is associated with a Th2 dominated response. 



potential repertoire and the number of idiotypes expressed by an individual at 
a given time (the expressed repertoire) are of macroscopic order. The study of 
the idiotypic network is thus clearly a playground for statistical physics. In the 
frame of a simple bit-string model we investigate the architecture of a randomly 
generated idiotypic network. We identify a working regime above the percolation 
transition where a giant cluster coexists with many small clusters, chosen such that 
immunological demands as the completeness of the repertoire and a persistent im- 
munological memory preserved by the internal image of antigen can be fulfilled. 
The dynamics of the idiotypic network is driven by the influx of new idiotypes 
randomly produced in the bone marrow and by the population dynamics of the 
lymphocytes themself. Modelling this dynamics by simple cellular automata rules 
we give first results describing the architecture of the idiotypic network as the 
highly organized product of a random temporal evolution. 



2. Thl-Th2 Regulation and Venom Immunotherapy of Allergy 

2.1. The Mathematical Models 

2.1.1. Thl/Th2 Regulation Figure 1 shows a (necessarily simplified) scheme of 
Thl/Th2 regulation, cf., e.g., [3,4]. These processes are the base of a mathematical 
model proposed in [5] which we adopt in the following. Since their lifetime is short 
compared to T-cells cytokines can be eliminated in a quasistationary approach. 
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Figure 2. Sensitisation and stimulation of a mast cell. Allergen- 
specific IgE is bound to IgE receptors (FceRI). The resulting aller- 
gen specific units have a density pj. Binding of a (bivalent) aller- 
gen to single units results in complexes of density Crosslinking 
triggers a signal cascade which leads finally to the release of in- 
flammatory mediators. 

Measuring the time in units of the half-life of T-cells and rescaling yields 



N = -N^-a- NA,[1\/{1 T P 2 T 2 ) T c] - (!)N A,,{T2 + c), (1) 

T, = -Ti + vNA,[l\/{\ + p,/A) T c], (2) 

T 2 = -T 2 + v(j)NA,XT2 + c)/[l + piTi/(l + A 2 A)], (3) 

4 = -A,{Tx^-T2)+U^). (4) 



where N, T\j 2 , and are the concentrations of naive cells, Thl, Th2 cells, and 
allergen presented by antigen presenting cells (AFC’s), respectively, a is the pro- 
duction rate of the naive T-cells, v the proliferation rate of the T-cells and 
the rate at which allergen peptide is presented by APC’s after an allergen in- 
jection. The dimensionless parameters and p 2 control the cross-suppression of 
Thl- and Th2-cells mediated by their cytokines and become important at high 
concentrations, whereas (j) regulating the balance of the autocrine effects of the 
Thl Th2 system is relevant at low concentrations. A small cytokine background 
c originating from other immunological processes is avssumed to be constant in 
time. In numerical simulations we use a = 10, c = 10“ ^ (f) = 1.02, = 0.2, 

and p 2 =0.1. As suggested by recent experimental findings [6,7] we chose here a 
parameter setting for (j) and pi .2 fundamentally different from that in [5], for more 
details cf. [8]. 

For allergic individuals the response to a large allergen dose, e.g., injected by 
a bee sting (in our units = 1), is Th2 dominated. Thus, stimulated B-cells will 
produce allergen specific IgE antibodies which sensitise mast cells as described in 
the following. 

2.1.2. Sensitisation and Stimulation of Mast Cells A simplified scheme of pro- 
cesses involved in sensitisation and stimulation of a mast cell is given in Fig. 2, 
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cf. also [9, 10]. The dynamics of the densities of allergen specific units p/, of single 
allergen-IgE-Fc6RI complexes ps, of the serum concentration of allergen A, and 



the intracellular Ca^+ concentration Qz is modelled by 

P'f = -^Pf - PfPs, (5) 

P. = Apf-pfps, ( 6 ) 

A = -0Apf + (7) 

Ca = pfps - Ca, (8) 



where time in units of half-live of and rescaled variables are used. Eqs. (5) 

and (6) describe that the free allergen specific units pj vanish by either forming 
single allergen complexes or by cross-linking to allergen already bound to an aller- 
gen specific unit. The expression of new allergen specific units on the cell surface 
is assumed to take long compared to the desensitisation process and is therefore 
neglected. Eq. (7) describes that allergen A enters the system at a rate and 
is eliminated by binding to surface IgE or to serum antibodies and different types 
of APC’s. Eq. (8) describes that intracellular Ca^'^ increases proportional to the 
number of cross-linked complexes and is removed by sequestration into the mito- 
chondria and by extrusion to the exterior. For the numerical simulations we used 
= 7 = 1, variations in both parameters do not change the qualitative behaviour 
of the system. 

2.2. Venom Immunotherapy 

A widespread and successful therapy of allergy consists in injecting doses of al- 
lergen following an empirically justified schedule of administration [11 13]. The 
therapy has two phases. The initiation phase starting with very low doses which 
are gradually increased may range between few days and several weeks depending 
on the protocol (ultrarush, rush, and conventional, [14]). During the maintenance 
phase large doses are applied every month over a period of several years. We hy- 
pothesize that the first phase is to desensitise the mast cells, whereas the second 
phase leads to a Th2/Thl switch as suggested by the nonlinear dynamics of our 
models. 

2.2.1. Th2/Thl Switch The response of the system to an injection of allergen 
depends on the dose of allergen and on the values of the Thl/Th2 populations 
at the time of injection. (The weak time dependence of the number of naive cells 
can be neglected for the purpose of this discussion.) The injection triggers the 
production of both Thl and Th2 cells. If all the allergen is used up both Thl and 
Th2 populations relax exponentially. For a given setting of the parameters /ii, /i 2 
and (/), and depending on the values of T\ and T 2 at the injection time the ratio 
TilT ‘2 systematically increases or decreases as long as the system is driven by the 
allergen. This behaviour divides the (Ti,T 2 ) plane into regions, where after an 
injection of a given dose of allergen the ratio T 1 /T 2 increases or decreases. These 
regions are separated by what we call an (allergen dose dependent) “dynamical 
separatrix” . Periodic injections of the same dose of allergen drive the system to new 
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Figure 3. Trajectories (black solid line) of the system (l)-(4) 
driven by injections of allergen projected to the (Ti,T 2 ) plane. 
The grey line is the dynamical separatrix for a dose Ap = 1. (a) 
Successful therapy applying the conventional protocol; rush and 
ultrarush protocols lead to qualitatively similar trajectories, (b) 
Failed therapy: the intervals between injections (greater than 2 
months) were too long, (c) Sensitisation by small doses of allergen 
{Ap = 0.1). The dotted line is the separatrix corresponding to this 
dose; the injections are applied in the region where they decrea.se 
T,/T,. 



stable periodic attractors on which T\/T 2 > 1 which is favourable and corresponds 
to the hyposensitised state or T[ /T-? < 1 which is clearly unfavourable. For our 
parameter setting the separatrix is above T 1 /T 2 = 1 so that a system that is 
initially in an allergic region (Ti /T 2 < 1) can be driven by periodic injections to 
the hyposensitised region where T 1 /T 2 > 1, cf. Fig. 3a. 

An improper choice of doses and intervals between injections may drive the 
system in the unfavourable direction: The therapy of an allergic individual fails or 
a healthy individual becomes allergic. Examples are shown in Figs. 3b and c. 

2.2.2. Desensitisation of Mast Cells What is the purpose of the initiation phase of 
the therapy? We argue that it desensitises mast cells and basophils thus enabling 
the injection of larger doses of allergen which provide the Th2/Tlil switch as 
discussed above. A main argument supporting this view is that the same sequence 
of increasing doses of allergen fulfils its purpose independent of whether this period 
extends over several weeks as in conventional protocols or over only few days as in 
the ultrarush protocol. This makes it improbable that the target of the initiation 
phase is the T-cell system which has a characteristic time in the order of weeks 
rather than days. 

In Fig. 4a we show the maximum of the Calcium signal as function of a 
single injection of a given dose. The response characteristic is a log bell-shaped 
curve because the response is driven by the crosslinking of receptors by bivalent 
allergens. If the calcium signal increases a certain threshold a signal cascade is 
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Figure 4. (a) Maximum Calcium signal inside the ma^tcell as 
a function of the allergen dose, (b) Decrease of allergen specific 
receptors pf on the mastcell surface (solid line) during injections 
(doses indicated in the bottom line) at half hour intervals and 
corresponding Calcium response (grey line), the dotted black line 
shows ps. 

activated which finally leads to the release of infiammatory mediators. This is 
prevented if the maximum Calcium signal stays below this threshold. Figure 4b 
shows that a certain subcritical dose diminishes the number of allergen specific 
receptors on the surface. In the next step a larger dose is allowed, the number 
of allergen specific receptors is further diminished and so on until this number is 
so small, that even a large dose of allergen will not cause an intracellular signal 
cascade. Then the mastcells are desensitised and the large doses of the maintenance 
phase can be applied without causing allergic symptoms. 



3. Idiotypic Networks 

3.1. The Bit-String Model 

B-lymphocytes carry on their surface highly specific receptors, so-called antibod- 
ies. If an antigen meets a suitable antibody, an antibody-antigen complex will be 
formed. The binding is based on the complementarity of the shapes of ligands. 
Such complexes can be recognised by “killer cells” or other cells whose function 
is to remove antigen. As conceived by Jerne [15], antibodies can not only detect 
antigen, but also recognise and bind to each other. This leads to a large functional 
network of mutually reacting B-lymphocytes, the idiotypic network. For recent 
experimental data stressing the importance of idiotypic networks see, e.g. [16,17] 
and very recently [18,19]. 

A widely used approach introduced in [20] caricatures the binding regions 
of antibodies by bit-strings i = (ii, ...,id), ii ^ {0, 1} of a certain length d which 
can be estimated as d = 32 . . . 36 [21]. Only antibodies will react whose binding 
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Figure 5. Comparison of a hypercube (left) with the one- 
mismatch base graph (middle) in d = 3 and the one-mismatch 
base graph in d = 4 (right). Solid lines connect perfectly comple- 
mentary vertices, dashed lines those with one mismatch. 



regions (z.e. the bit strings) are complementary to a certain degree. This is specified 
by so-called matching rules. For example, the one-mismatch rule allows binding 
between bit-strings that are complementary with the possible exception of one 
mismatch. Bit-strings of length d can be represented as the vertices of a hypercube 
in dimension d. Vertices are thought to be connected by a link if a given matching 
rule allows binding. These vertices and links constitute the base graph Gd, cf. 
Fig. 5. The set of all vertices represents the potential idiotypic repertoire. At a 
given moment of time, in a real organism, only a selection of all possible idiotypes, 
the actual repertoire, is present. Identifying a present idiotype with an occupied 
vertex, the actual repertoire builds a graph F which is a subset of the base graph 
Gd. To characterise this graph is clearly a task for statistical physics. 

We first investigate the architecture of graphs generated by occupying the 
vertices of the base graph with uniform probability p [22]. This gives the insight 
that an immune system (IS) with randomly occupied idiotypes has the capability to 
work. Apart from theoretical value this could be important for the understanding of 
the IS of new born individuals. In a second step we describe the random evolution 
of the idiotypic network which is driven by the infiux of new idiotypes from the 
bone marrow and by the population dynamics of the lymphocytes. We concentrate 
on a very simple set of rules and extremely few parameters thereby aiming at a 
description of fundamental principles of operation. 

3.2. Architecture and Percolation Transition 

The problem of random occupation of our base graphs differs from an ordinary 
percolation problem since their structure changes as the system size increases. We 
mention only (for more information see [22]) that they have a diameter roughly 
proportional to d/2 and, considered from an arbitrary vertex v, the number of 
vertices with distance d{v) is increasing with d{v) so that most of the vertices of 
the base-graph have nearly the maximal distance d/2 from v. The following results 
refer to the one-mismatch-rule. 
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Figure 6. Distribution of cluster sizes for diflFerent values of the 
occupation probability p for the one-mismatch rule in d = S (left). 
The same at the percolation threshold p = Pperc in d = 10 and 14 
(right). 



Figure 6 (left) shows the cluster size distribution for three typical values of 
p: (i) p = 0.1 < Ppcrc- Clearly, all random clusters are small, there is no giant 
percolating cluster, (ii) p ^ Ppvrc. Increasing p and approaching infinite systems a 
sharp percolation transition was found. At Ppcrc several characteristics of the sys- 
tem obey scaling laws. Fig. 6 (right) illustrates the (finite size) scaling behaviour. 
Series expansion techniques lead to a Fisher exponent r = 3/2. (iii) If the oc- 
cupation probability is further increased, a giant cluster appears which coexists 
with many small clusters. Biologically, this region appears to be the relevant one. 
Small clusters are thought to be necessary for the preservation of immunological 
memory [1,23]. The entity of these small clusters is called peripheral IS. On the 
other hand also a giant cluster is deemed to be necessary, the so-called central IS. 
The function of this part of the IS is still disputed, but could be related to the 
control of autoreactive antibodies [24 26]. 

If still greater values of p are considered, a second threshold Pcxmn comes into 
play. For p > Pconn almost every random graph F will be connected and dense. 
Via some graph theoretical considerations Pconn=l — lim^/^oo "id being 

the coordination number, can be proven. However, in a regime where all idiotypes 
are functionally connected, local changes will percolate through the whole system 
and will not lead to persisting memory. Hence it is clear that only parameters 
P < Pcomi will be of interest. On the other hand, one has to impose a second 
fundamental demand: The IS should be able to detect every conceivable antigen. 
If antigens are modeled in the same way as antibodies the actual repertoire F has 
to be dense. However, being not connected and dense are conflicting demands. 
Reasonable values for p will therefore be slightly below the connectivity threshold 
where F is still almost dense, but not fully connected. 

A next point of interest is the internal structure of the giant cluster C. To 
classify it we used two methods. Firstly, we investigated the “mass distribution” 
of C which is My[s) = 11^' G C,d{v,v') = s}| averaged over all vertices v e C. 
As a second indicator we applied an edge-fragmentation algorithm. Here we cut 
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Figure 7. Two hypothetic situations for the intrinsic structure 
of giant clusters. The circle symbolizes a core of strongly inter- 
connected vertices. The left side shows several loosely connected 
cores, whereas on the right only one core exists. 

every edge of C and determine the mean number of subclusters which result. 
Both methods combined enable to distinguish, e.^., between the different internal 
structures of C shown in Fig. 7. The existence of more than one core would imply 
several extrema in the mass distribution. The numerical results show that this is 
not the case. Also the “no structure at all” hypothesis can be ruled out for values 
of p not too far away from Ppon- . 

Thus, there appears to be a range of occupation probabilities which allows the 
coexistence of small clusters and a giant component, in which the actual repertoire 
is almost dense and the giant cluster has an intrinsic structure consisting of one 
giant strongly connected core and by one link attached subclusters. 

Clearly, the one-mismatch rule is far too idealized to represent the real situa- 
tion. For real networks the occupation probability can be estimated by the ratio of 
actual to potential repertoire as p ^ 1/100. The above properties hold in principle 
also for a two-mismatch rule for which Pp^^-c approaches this order. An idiotypic 
network operating in a proper regime is as we have shown able to meet the 
tradeoff between memory preservation and the need to detect nearly all antigen 
structures. A second result is a strong indication that the giant component (i.e. 
the central part of the IS) has an internal structure. 

3.3. The Randomly Evolving Network 

B cells are stimulated if their receptors are sufficiently costimulated, i.e. cross- 
linked by both chains of the complementary antibody. Crosslinking, however, is 
only possible if the concentration of these antibodies is not too small (encounter 
probability between receptor and antibody too small) or too high (each receptor is 
bound to a different antibody). Hence there is a window of antibody concentration 
that allows stimulation. 

In a simple cellular automaton-like model we relate the presence of antibodies 
of a given idiotype to the number of incident links on the corresponding vertex: 
Only idiotypes v with a number of links l{v) within a window // < l{v) < lu 
will survive. Furthermore, the production of lymphocytes of new idiotype in the 
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Figure 8. Typical evolution of the total population size for a 
randomly evolving network applying the cellular automaton rules 
described in the text. One iteration step roughly corresponds to 
one day. psuirt = 0.2, {k.lu) = (1,8), influx 7 = 5. 

bone marrow is included. Estimates based on experiment hint to a production rate 
which allows to replace the whole network in a couple of days [27]. The algorithm 
consists of the following major steps: 

0 . occupy all vertices randomly with a given probability Pst art 

1 . occupy I empty vertices randomly (mimicking influx from bone marrow) 

2 . remove all vertices v with l{v) outside the stimulation window 

3 . iterate steps 1 and 2. 

Figure 8 shows a time series of the aggregated total number of idiotypes 
generated by applying the above algorithm to the one-mismatch base graph in 
d = 12. The evolution exhibits a typical behaviour. Starting with a random initial 
occupation the population grows strongly during the first couple of time steps until 
it reaches a maximum, then drops to a minimum and finally grows again reaching 
a stationary state. Clearly, during the first few iterations the system acquires new 
vertices and will grow as long as not the whole “space’ is used. However, growth 
and the adjustment of the network structure don’t go hand in hand. Therefore, a 
“growth-dominated-phase” will be followed by an “adjustment-dominated phase” 
during which unfavourably placed vertices will be removed. Having finished this, 
the network can relax into a relatively stable stationary state. This stationary state 
should be further investigated inquiring into characteristics such as clustering and 
degree distributions. 

Interestingly, depending on the choice of the window the evolution leads to 
fundamentally different clustering structures of the network. If the window is quite 
narrow, e.g. (//,/„) = (2,4), the two-clusters increase in number and tend to fill 
the system densely. During the growth-dominated phase of the evolution freshly 
produced idiotypes are recruited into a giant cluster. Later on in the “adjustment- 
phase” this cluster dissociates, the network connectivity declines as observed with 
growing age in real immune systems. The appearance of many two-clusters seems 
very close to Jerne’s broadly accepted idea of “internal-image” memory. Other 
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choices of the window lead to a less stationary behaviour with larger fluctuations. 
In this case the algorithm leads to a giant cluster coexistent with smaller clusters. 
The latter prove to be very short-lived but, nevertheless, take part in a kind of 
dynamical pattern formation. 

These features make this approach very interesting for future work in mod- 
elling idiotypic networks. 
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1. Introduction 

Influenza virus binds to the reeeptors of the host eell plasma membrane via its spike 
glycoprotein hemagglutinin (HA) (Fig.l). After uptake of influenza virus into a 
host cell by the endocytic route, the ectodomain of HA undergoes a conformational 
change at the acidic pH milieu in the late endosonial lumen, and triggers the fusion 
between the viral envelope and the endosomal membrane. The conformational 
change is accompanied by the exposure of a hydrophobic fusion sequence inserting 
eventually into the target membrane [1]. Interaction with the receptor of the target 
cell may be essential for the conformational change. 

To elucidate the intermediates of fusion pore genesis and their structures, in- 
stead of the fusion of the virus itself with the endosomal membrane, fusion between 
HA expressing cells and appropriate target cells, e.g. red blood cells (RBCs), is 
studied. Such a model system even allows to detect single fusion events between 
of two fusing cells (Fig. 2). Those studies revealed that aggregation of the fusion 
proteins to a multimeric complex is required to form a fusion pore [2 4]. 

Mathematical models provide a powerful tool to simulate the fusion process, 
in particular between fusing cells, on a quantitative level and, by that, to under- 
stand common mechanisms of enveloped virus fusion [5 9]. Here, we describe a 
stochastic approach which aims at predicting the statistical distribution of the 
various time-dependent stages in pore formation. The approach explicitly takes 
into account the stochastic nature of the various elementary processes underlying 
the fusion process on the level of individual cell-cell contacts: the size of the con- 
tact area between two fusing cells, the lateral movement of fusion proteins and 
receptor molecules in the respective bilayer, and the interactions between fusion 
proteins and receptors. 
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Figure 1. Entry of influenza virus into host cells 



2. Model 

The initial step of fusion pore formation consists in the binding of an HA-expressing 
cell to the target cell. All small contact sites between the cells, where both cell 
membranes lie approximately parallel and sufficiently close to each other, consti- 
tute the effective contact area which is approximately [10]. Both membrane 

regions involved in a contact site are represented gis continuous two-dimensional 
lattice constituted by small squared membrane units (“unit cells”. Fig. 2) with an 
edge length of 6 nm which corresponds approximately to the spatial extension of 
HA trimers (see [11]). Diffusion of HA trimers and receptors is modeled by random 
transitions between adjacent cells of the simulation lattice. This lateral movement 
of integral membrane proteins is mainly brought about by random collisions with 
membrane lipids. 

A single unit cell cannot be occupied by more than one molecule (HA or 
R) in each of the respective membranes. The effective contact area is dissected 
into smaller simulation areas of a lattice of 50x50 unit cells. Experimental data 
support that this size is in good agreement with that of a single contact site. 
The cumulative distribution function Ftot{t) for any fusion intermediate to occur 
within the time span t in the effective contact area between two fusing cells can 
be related to the corresponding distribution function Fsampie{t) for a simulation 




Stochastic Model of Influenza Virus Fusion 



413 




Figure 2. Fusion of HA-expressing cell with double-labeled red 
blood cells. RBCs were double-labeled with a fluorescent aqueous 
dye (cytoplasm, D-F) and a fluorescent lipid-like dye (membrane, 
G-I). RBC was bound to an HA-expressing cells (A-C phase mi- 
croscopy; D-I fluores(!ence microscopy). At time 0 min (A,D,G), 
fusion was initiated by lowering the pH from 7.4 to 5.0. Both flu- 
orophores are still confined to the RBC. Images taken after 1 min 
39 s (B,E,H), when the lipid fluorophore (H) but not the aqueous 
dye (E) starts moving into the fibroblasts (LP-signal). This signal 
is preceded by a sudden drop of the fluorescence of the lipid dye 
due to the formation of an early ion-permissive pore (IP-signal) 
(not visible, see text). Images taken after 5 min 2 s (C,F,I), when 
aqueous dye has redistributed to the cytoplasm (F), while the lipid 
fluorophore is spreading over the surface of the plasma membrane 
of the HA expressing cell (I). Experimental data in Fig. 5 corre- 
sponds to intensities measured in region 3 (IP-signal) and region 
4 (LP-signal). Reproduced from The Journal of Cell Biology [4] 
by copyright permission of the Rockefeller University Press. (For 
colored picture and more detailed explanation see color plate 15). 
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Figure 3. Definition of the simulation lattice (A) The contact 
area between fusing cells where the two membranes are arranged 
in parallel to each other and in a sufficiently close distance al- 
lowing the formation of HA-receptor contacts. The arrows illus- 
trate how the various macromolecular structures involved in the 
fusion process are mapped onto the simulation lattice. (B) Occu- 
pation state of the simulation lattice derived from the constella- 
tion in panel (A). Generally, a unit cell of the simulation lattice 
can be occupied by one of the shown variables. For details, see 
text. (C) Definition of an ion-permissive pore (IP-signal) and a 
lipid-permissive pore (LP-signal). Modified according to [9]. 

lattice by 

FuAt) = 1 - [1 - F,ann>le{t)f ( 1 ) 

where N refers to the number of simulation lattices required to cover the effective 
contact area. Based on the size of the effective contact areas, the distribution 
function (Eq. 1) has to be calculated from the distribution function of N=12 (12 
0.09 iirn^ ^ 1 //m^) simulation lattices. 

2.1. Model Variables and Processes 

The variables of the fusion model and the elementary processes which may take 
place between them are detailed in the following. HA: Activated hemagglutinin 
trimers: Upon acidification, the initial step of the fusion cascade is the confor- 
mational change of the HA ectodomain. Since the change occurs in a very short 
time-span compared with the rate-limiting processes of fusion pore formation [12], 
we consider all HA trimers to be in the active form. 

R: Receptor. HA binds to a sialic acid containing receptor of the target mem- 
brane. In erythrocytes this receptor is glycophorin. Lateral diffusion of activated 
HA trimers and R is modeled as a random walk from one unit cell into one of the 
four adjacent unit cells (left, right, up, down). 

HA-R: HA-receptor contact is described as reversible process which may take 
place if an HA trimer and a receptor molecule occupy unit cells which lie on top 
of each other. Those contacts are important for refolding of HA toward a fusion- 
competent conformation [13]. 
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HA*: Immobilized HA trimer captured in an HA cluster. HA aggregation 
occurs instantaneously whenever HA trimers occupy adjacent cells. 

C: HA -receptor-membrane link results form a transition of HA-R to a sta- 
ble, non-reversible link between HA and the target membrane described as an 
irreversible first-order reaction. HA-R, HA* and C are considered as completely 
immobilized. 

IP: Ion-permissive early fusion pore is the initial small opening in the contact 
site which makes the inner lumen of the target cell continuous with that of the 
HA-expressing cells. This process leads to a change in the membrane potential 
of the target cell (IP-signal) which can be measured by voltage sensitive dyes [4] 
(Fig. 2). We assume that the ion-permissive fusion pore consists of several HA- 
receptor contacts. Since the minimum number of membrane proteins required to 
form an opening is three, we define the early fusion pore as a cluster of H A-receptor 
contacts arranged in triangle configuration (cf. Fig. 2). 

LP: Lipid-permissive pore: The ion-permissive fusion pore may advance to a 
larger and more stable pore, the so-called lipid-permissive (or lipid mixing) pore 
which can be monitored by lipid dye transfer between membranes (LP-signal) (Fig. 
2). In the model, the transition IP LP requires that at least three HA-receptor 
contacts of the early fusion pore arranged in triangle configuration transit into 
stable HA-receptor-membrane links (C). 

2.2. Master Equation and Simulation Technique 

The chain of events leading to the formation of a fusion pore between individual 
cells defines a Markov process which is governed by a Master equation. Let p^ - 
denote the probability to find a molecule of type X (X = HA, R, HA*, HA-R, C) 
in the unit cell (i,j) at time t (the time argument will be dropped in the following). 
The subscripts i and j (i,j = 1,...,50) label the row and column of the simulation 
lattice. Generally, the time-evolution of the probability pfj is governed by the 
Master equation 



i'.j' X' 

The linear time-evolution operator A{i,j,X i',j',X') in (2) gives the 
probability with which occupation of the unit cell (i,j) by the molecular species 
X is affected by the molecular species X' resident in the unit cell (i’,j’). Since 
the kinetic processes included into the model may take place either in a single 
cell (formation of HA-R or C) or between adjacent cells (lateral diffusion of HA, 
formation of HA* clusters) the summation across the cell indices i’ and j’ in the 
Master equation (2) actually covers only i' = i,i l,i — 1 and j' = j, j + I, j — L 
Numerical solution of the Master equation (2) can be carried out by means of a 
compact and simple simulation algorithm [14]. Generating random initial occupa- 
tions: At time zero, HA trimers and receptor molecules are randomly placed into 
the unit cells of the simulation lattice by generating a random number ze[0, 1] for 
each unit cell of the lattice and putting a respective molecule into the cell if this 






( 2 ) 



dt 
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random number is not larger than the given particle density. Choosing a random 
time step: The probability Prest that the distribution of the model variables across 
the simulation matrix does not change during the time interval decreases ex- 
ponentially, Prest — Atot is the total transition probability obtained by 

adding up the transition probabilities of all possible elementary processes consid- 
ered in the model. Thus, the time required for any transition to occur is a 
stochastic quantity which can be computed by A^ = —{1 / with z being 
a uniformly distributed random number in [0,1]. The absolute time t is increased 
by the time step A^ i.e. t ^ t At. Randomly selecting a distinct transition pro- 
cess: The next step is to select from the whole set of all (Ntot) possible transition 
processes a single transition process to be executed within the time span At. To 
this end, one process out of all possible processes is selected. The probability of 
a process i to be chosen corresponds to its relative transition probability AijAtot 
Updating the occupation numbers of the unit cells: Depending on the single transi- 
tion process chosen, the occupation numbers of the involved unit cells have to be 
updated. If, for example, the selected transition process consists in the formation 
of an HA-R contact in the unit cell (i,j), new occupation numbers are obtained by 
putting Nl'/ - 1, ^ - I, ^ + 1. 

N[j denotes the occupation (= 0 or 1) of the unit cell (i,j) by the molec- 
ular species X. The steps of the simulation procedure are repeatedly executed 
until at least one ion-permissive pore and one lipid-permissive pore (for definitions 
see above) have emerged or, alternatively, until all HA triniers and receptors are 
trapped into isolated HA-receptor-membrane links, which make the formation of 
an ion-permissive pore or a lipid-permissive pore impossible. 



3. Stochastic Simulations of Single-Cell Fusion Kinetics 

The proposed mathematical model was first applied to simulate single-cell fusion 
kinetics as measured by Blumenthal et al. [4]. The following numerical values 
of the kinetic parameters were used in these simulations. The approximate den- 
sity of mobile HA trimers in the plasma membrane of the HA expressing fibrob- 
lasts cell lines [3] and of the sialic acid containing glycophorin A in erythrocytes 
are 750 iim~\ respectively. The rate constants for the diffusion of HA trimers 
(knA — 830s~^) and receptor molecules {kR = 83s“^) were calculated from mea- 
sured diffusion coefficients 3 [3] and 3 [15], respec- 

tively, according to D = Ax^/At^ yielding k = AD/Ax‘^ with Ax = 6nm for the 
rate with which a transition takes place from one unit cell to the neighboring 
unit cell. Numerical values for the remaining three rate constants were chosen 
such that a reasonable concordance between simulated and observed data was 
achieved: (a) association rate = 0.2s“^ for the formation of an HA-R contact 
{HA R ^ HA — R); (b) cissociation rate k- = 0.15s“^ for the decay of an 
HA-R contact {HA - R HA + R); (c) rate kc = 0.15s~^ for the formation of a 
tight HA-receptor-membrane link. The Master equation approach outlined above 
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provides values for the stochastic variables and representing the 

delay times for the first occurrence of an ion-permissive fusion pore and of a lipid- 
permissive pore in the simulation lattice. Repeating the stochastic simulation for 
a sufficiently large number {N,^ampic) of independent simulation lattices one may 
generate sets of stochastic delay times,r}""'^^"'" and ,n = 1, . . . , Nsampie, 

to derive cumulative frequency distributions 



^ sample 



it) 
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ample 
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whereby 0(x) denotes the unit-step function, i.e. 0(x) = (1 if x > 0; 0 else). By 
employing relation Eq. (1), these cumulative frequency distributions for the first 
occurrence of a fusion-pore intermediate (ion-permissive pore or lipid-permissive 
pore) on the simulation lattice can be used to calculate the related cumulated 
frequency distributions FlJI{t) and for the total effective contact area. 

Figure 3 shows an example of the time-dependent stochastic trajectories for 
free re^‘eptors, HA-R contacts and stable HA-receptor-mernbrane links for one 
simulation on a 50x50 lattice. The reversibility in the formation and decay of 
HA-R contacts is reflected by large fluctuations of the respective trajectory. The 
vertical lines in Fig. 3 indicate the delay time t,j, = 134 s for the first occurrence 
of an ion-permissive pore (IP-signal) and tip = 605 s for the first occurrence of a 
lipid-permissive pore (LP-signal) during the simulation. From repeated simulations 
on a 50x50 simulation lattice we constriuded the statistical distributions (Eqs. 
3 and 4) of the characteristic time span needed for the first occurrence of an 
ion-permissive pore and of a lipid-permissive pore in a simulation lattice. Next, 
the respective distributions for the total contact area were computed by relation 
Eq. 1 considering the contact area to be constituted by 12 contact sites (N=12). 
The obtained theoretical distributions of IP-signals (first occurrence of an ion- 
permissive pore) and LP signals (first occurrence of a lipid-permissive pore) are 
both in reasonable concordance with the measurements of Blumenthal et al. [4] 
{Fig. 3). It is seen, however, that the calculated cumulative distribution of IP- 
signals exhibits a slightly steeper ascend than indicated by the measurements. 

In conclusion, we have developed a mathematical model which explicitly takes 
into account the stochastic nature of the molecular events underlying the fusion 
process. As we have recently shown [9], a particular advantage of our approach 
is the possibility to model fusion signals measured in cell suspensions as a super- 
position of stochastic fusion signals arising from single-cell fusion events. To our 
knowledge, this is the first successful attempt to model consistently both types of 
fusion kinetics. This model can also be applied to fusion pore formation of other 
enveloped viruses. Indeed, despite differences, e.g. in the molecular architecture of 
those proteins, and the trigger for the conformational change, essential features of 
the fusion process seems to be conserved among enveloped viruses. Finally, it has 
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FiGURp] 4. Simulation results for a 50x50 simulation lattice. Ex- 
ample of stochastic trajectories for free receptors (R), HA-R con- 
tacts and stable HA- receptor- membrane links (C). The vertical 
lines indicate the delay time tji> = 134 s for the first occurrence 
of an ion-permissive pore (IP-signal) and tj^j^ = 605 s for the 
first occurrence of a lipid-permissive pore (LP-signal) during the 
simulation. Modified according to [9]. 



to be pointed out that the proposed stochastic approach is also applicable to other 
types of membrane-membrane interactions as, for example, cell-cell attachment. 
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1. Biological Rhythms and Dynamical Diseases 

Nearly all functions of the body exhibit nonlinear behaviour, a prominent example 
being the physiological rhythms that are autonomous in nature, e.g. heart beat, 
respiration or peristaltic of the gut. According to the different functions of the 
body rhythms, the time scale of their periodic activity can vary from seconds 
(heart) to hours (sleep-awake cycle) to days or months (e.g. menstruation). More- 
over, they appear at nearly all levels of morphological organisation, ranging from 
the intracellular space to tissue and organs. One important feature of rhythmic 
behaviour is its ability to re^spond in a characteristic manner to different external 
requirements. IncTease in the frequency of the heart beat to increased stress is one 
impressive example. 

In general, the normal state is characterised by synchronisation of endogenous 
rhythms with external rhythms and adaptation of body functions to altered re- 
quirements via changes in periodicity (e.g. jet lag). The state of illness represents 
perturbations of such nonlinear functions which manifest themselves as typical 
dynamical changes, e. g. in the frequency domain. Well known examples are ar- 
rhythmic activity of the heart [1, 2] or disorders of the brain [3, 4]. The arrhythmic 
activity affects both, the normal function of the organ/body as well as its abil- 
ity to adapt to external requirements. In this case, the primary need to restore 
the healthy state is to restore the unperturbed nonlinear dynamics. In the past, 
most therapeutic measures consider the treatment of particular malfunctions only 
on an organic or molecular level, without taking into account possible nonlinear 
dynamic behaviour on levels of higher complexity. However, during the last ten 
years clinical practise has recognised more and more the fundamental meaning of 
rhythrnicity for the therapy of diseases. For example, chemotherapeutic treatment 
of cancer can be improved, when there is a periodic instead of a linear dosage 
of the cytostatica [5]. The cancer cells show a marked difference in the growth 
dynamics, i.e. arrhythmicity of the cell cycle, shift of the cell cycle phases with 
respect to normal growing cells. Accordingly, it is possible to supply the cytostatic 
agents in a periodic fashion, such that there is a maximal anticancer effect and a 
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minimal side effect on healthy cells. Other diseases result from a lack of synchro- 
nisation between endogenous and external rhythms. The sleep/awake rhythm is 
entrained via day/night cycles. Blind people can not recognise this external cycle 
and therefore often suffer from sleeping perturbations. In order to improve the 
sleeping behaviour, the sleep/awake rhythm of blind people is synchronised with 
a new zeitgeber [6]. Instead of light/dark cycles, blind people are treated with al- 
ternative external rhythms, as for example temperature and/or nutrition. At fixed 
time instants, the patients receive a cold shower in the morning and a protein 
rich breakfast. Additionally, stimulation of the circulation by means of ergometer 
training at fixed time instants can improve the therapy. After only 3 weeks of 
treatment with the new zeitgebers, the blind people show nearly normal sleeping 
behaviour. The concept of dynamical diseases has been treated theoretically to 
some extent in references [7] and [8]. 



2. Spatio-Temporal Patterns and Dynamical Diseases 

One important feature of rhythmic reactions is their ability to generate dynam- 
ical as well as stationary patterns in time and space. This knowledge has come 
initially from investigations of chemical or physical systems under far from equi- 
librium conditions [9] and is nowadays also accepted for biological systems [10]. 
The spatio-temporal dynamics introduces a new hierarchic level for the functions 
of rhythmicity, but also for their malfunctions. The initial state of heart infarction, 
the generation of heart fibrillations, document this phenomenon very well. Under 
normal conditions, there are intact waves of electrical excitation that propagate 
over the heart muscle, leading to a co-ordinated excitation-contraction coupling 
which finally leads to heart muscle contraction. In case of heart fibrillation, the 
leading front of such a wave is assumed to break up, leaving open wave ends that 
form rotating spirals of electrical activity [11], well known as re-entrant phenome- 
non. The centres of the spiral waves act as new wave sources that compete with the 
normal pace maker of heart excitation in the sinus node. Under such conditions 
a co-ordinated contraction of the heart muscle is no longer possible. This is an 
impressive example that demonstrates the need to consider the spatio-temporal 
dynamics of rhythmic reactions for the development of successful therapies. 

The spatio-temporal patterns of rhythmic reactions result from the coupling 
of this type of nonlinear (autocatalytic) reactions with transport processes, for 
example diffusion (in fluid systems) or the electrical conductivity (in neuronal or 
heart tissue). In particular, reaction-diffusion coupling is very common also in other 
systems, for instance in chemical or physical ones [12]. Investigations of pattern 
formation in such systems demonstrate the overall validity of this basic principle 
for self-organisation. Traveling waves of activity belong to the most commonly 
observed types of dynamic patterns and their propagation dynamics in different 
systems share many properties. The critical radius for wave formation, the dis- 
persion relation or the trajectory of spiral tips are some characteristic features of 
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the propagation dynamics of such waves [9] . Remarkably, there is no or only little 
difference between waves in biological or chemical systems, with respect to these 
characteristics. In recent years, much effort have been devoted to develop tools for 
controlling the propagation of the waves, for instance by applying electric fields 
or light signals, as amply demonstrated in the chemical Belousov-Zhabotinsky 
reaction [13, 14]. This possibility for external control of reaction-diffusion waves 
opens new aspects for the treatment of dynamical diseases with spatio-temporal 
implications. 



3. Traveling Waves of Spreading Depression 

We now turn to the description of a spatio-temporal phenomenon with possible 
implications for dynamical diseases of spreading depression (SD) [15]. This phe- 
nomenon occurs in all gray matter regions of the central nervous system (CNS) 
and it has even been observed in the spinal cord [16]. SD is named after early 
observations of a propagating transient depression of electric activity. Today, it 
is better characterised by a dramatic failure of ion homeostasis associated with 
the depression. In the cortex it has been often linked to the aura of classic mi- 
graine [17, 18]. Moreover, it has common features with other patho-physiological 
disorders of brain tissue, as for example ischemia [19] or anoxia [20]. In addition, 
an interplay between SD and epilepsy has been reported [21]. From a biomedical 
point of view the intensive research of SD over the last half-century is justified due 
to its pathological implications. Yet, there remains a significant lack of knowledge 
with respect to its basic mechanisms. 

There is little doubt that SD belongs to the class of phenomena that are 
known as reaction-diffusion processes in excitable media, such as heart tissue [22], 
the slime mould Dictyostelium discoideum [23], Xenopus oocytes [24] and NADH- 
waves in glycolysis [25]. In each of these systems one can observe the generic 
features derived from theoretical considerations of standard wave propagation in 
these media, namely the propagation of circular or spiral-shaped waves [26, 27]. 
Since the discovery of retinal SD, many investigations have been performed with 
the isolated diicken retina. It is comparable to the cortex, at least with respect 
to its layer structure. This system is excellently suited for SD research, because 
it combines several experimental advantages. Foremost, an intrinsic optical signal 
(lOS) with high amplitude is associated with SD in the retina. Changes in the 
optical properties of the retina during a SD attack can be seen with the naked eye 
as a milky wave front on a dark background [28] . The transition from transparent 
to milky tissue is likely to be caused by cell swelling, but little is known about the 
mechanism. The chicken retina lacks a vascular system. Consequently, SD waves 
propagate through an essentially homogeneous system (at least on the macroscopic 
scale, which we are interested in). The complete avascularity of the avian retina ap- 
pears to be associated with the presence of a supplemental nutritional device, the 
pecten. Covering a part (about 1 mm'^) of the retina from the site of the exit of the 
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Figure 1 . Schematic drawing of the experimental setup to record 
waves of spreading depression 
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optic nerve and projecting into the vitreous humor, the area of the pecten is not 
accessible for recording wave propagation. Spreading depression can be elicited 
by mechanical, electrical and chemical stimuli in all grey matter regions of the 
CNS. In the chicken retina, the susceptibility to SD is comparably high. Gently 
touching the retina with a sharp glass needle (diameter 50 /im) initiates an out- 
ward traveling SD wave. Tetanic electrical stimulation can also provoke SD, but 
it may trigger epileptiform seizures, too. The retina can be considered in good 
approximation as a naturally two-dimensional slice of nervous tissue. The possible 
topologies of wave patterns are restricted to rotating spirals and target patterns 
(concentric circles) known from a well- investigated chemical model system, the 
Belousov- Zhabot insky reaction [29]. Video image microscopy of excitation waves 
in the retina have been described in some detail in reference [30]. As shown in Fig. 
1, a stereo microscope equipped with a charge-coupled device (CCD) video camera 
and a control unit records the wave dynamics via a time lapse video recorder. The 
video signal is digitised into a frame of 512 x 512 pixels using 1 out of 256 (8-bit) 
intensity values (grey levels) for each pixel. The resulting movie is then further 
processed by appropriate image processing techniques [31]. With this apparatus 




Figure 2. Spreading Depression waves in the chicken retina. A. 

An outward expanding circular shaped wave. B. A rotating spiral 
shaped wave of spreading depression, induced by break up of a 
circular wave front. Bar = 1mm. 

circular (Fig. 2A) and spiral-shaped (Fig. 2B) wave fronts of spreading depression 
can be readily observed and evaluated in terms of wave profiles, front velocity, spi- 
ral rotation frequency and trajectories of the spiral tip. One of the characteristic 
properties of wave trains in excitable media is the dispersion of wave velocities, 
which in the normal case means an increase of velocity with increasing wave length 
(Fig. 3). An extrapolation of this curve to minimum- velocity yields the minimal 
wavelength between two waves. This minimal wavelength represents the absolute 
refractory zone in the back of an excitation wave, where no new excitation can 
occur. It is characterised by the regeneration of the tissue. In the retina, the abso- 
lute refractory period is about 2.5 minutes [32] corresponding to a distance of 10 
mm, the wave velocity being 4 mm/min. The propagation dynamics of excitation 
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period / [min] 



Figure 3. Dispersion relation of spreading depression waves The 
normalized velocity is plotted as a function of the refractory pe- 
riod. The value obtained from a spiral-shaped SD wave (circular 
mark) is added to the graph. 



waves, as for example the trajectory of spiral tips, are strongly affected by the 
size of the refractory zone. For spiral waves it is important to analyse the proper- 
ties of the spiral tip and its dynamical evolution. The tip trajectory is found by 
tracking the tip locations of a rotating spiral step by step and connecting them 
to a line (for a definition of the tip location see [33]). Common examples that one 
finds in excitable media are rigid rotation (circular trajectory) and meandering 
(hypocycloidal trajectory). A rotating spiral of retinal spreading depression has 
a more complex trajectory. Figure 4 shows an example obtained from a 10 min 
recording of a spiral SD wave in the chicken retina. We note from this trajectory 
that the motion of the spiral tip in this particular neuronal medium consists of 
a sequence of Z-type rotations, where four spiral rotations share almost the same 
pivot. This observation points to some characteristic properties of the investigated 
system which have to be elucidated by further investigations of cellular processes 
that occur during retinal SD. 
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Figure 4. Pattern of a spiral tip trajectory from the SD wave in 
chicken retina composed of four spiral rotations. Arrows indicate 
the direction of movement. 

4. Implications for the Aura of Classical Migraine 

Individuals suffering from classical migraine report an astonishing diversity of mi- 
graine auras. A frequently reported symptom is a visual hallucination known as 
fortification illusion or zig-zag pattern [34, 35], see also Fig. 5. The characteris- 
tic form and development of the fortification illusion suggests that the underlying 
phenomenon is a wave propagating through the primary visual cortex, possibly the 
cortical spreading depression (CSD) [17, 36]. In fact, the first attempt to connect 
migraine with a cerebral propagation phenomenon was made by Lashley (35) 3 
years before SD was discovered. While suffering from attacks of classical migraine, 
Lashley calculated, from knowledge of the retino-cortical projection, the speed of 
the travelling scotoma to be about 3 mm/min in the visual cortex. This result fits 
perfectly with the measured velocity of a SD wave, as noted by Milner (37). The 
demonstration of unique changes of cerebral blood flow during attacks of migraine 
with aura have been replicated in animal experiments during CSD and thus con- 
stitute another important line of support for the spreading depression theory, as 
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Figure 5. Sketch of the fortification illusion drawn by Karl Lash- 
ley [35]. The kidney shaped visual disturbance exhibits a typical 
zig-zag pattern at its convex side, while the inner part corresponds 
to a scotoma (blind area). Usually the disturbance starts close to 
the fovea and increases in size as it accelerates to the periphery 
(from Lashley 1941; copyright 1941, American Medical Associa- 
tion). 
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reviewed by Lauritzen [17]. We propose that the perceived migraine hallucinations 
are generated by a planar excitation wave (CSD-like event) propagating across the 
primary visual cortex (VI), and that the peculiar zig-zag front of the fortification 
illusion reflects the cortical organisation of orientation preference. 

In general, the response properties of neurons in the VI to visual information 
are exceedingly well organised, albeit complex. First, VI is retinotopically organ- 
ised such that stimulating neighbouring points in the visual field leads to activation 
of neighbouring neurons in the cortex. Another major functional property of vi- 
sual cortical neurons is their orientation selectivity, i.e. the fact that the cells fire 
maximally when a contour of a particular orientation appears within their recep- 
tive field. As demonstrated by single-unit recording [38] and, more recently, with 
optical imaging techniques [39, 40] neurons with similar preferred orientations 
are clustered together and not distributed randomly across the cortical surface. 
The layout of orientation preference in the visual cortex is characterised by iso- 
orientation domains that are arranged around centres in a pinwheel-like faishion 
[40, 41]. 

Loewel et al. have constructed an orientation preference map of the area 
17 in VI from cats using optical imaging of intrinsic signals [42]. Based on this 
orientation preference map, we have calculated the average orientation vectors v 
for several fragments of this map [43] . These vectors were represented by a bar that 
was rotated such that its orientation was parallel to the direction of the vector. The 
magnitude of the vector is a measure of dominance among the fragments. In this 
way, a total of 869 vectors v were calculated, each having its particular orientation 
and magnitude, representing its dominance. The orientations were plotted as bars, 
each at its spatial location. Since the bars now possibly intersect, they were plotted 
in the order of their dominance on top of each other as white-black-white steps, 
representing the basic structure of the receptive field of simple cells in VI. The 
resulting pattern is illustrated in Fig. 6. It displays features of the fortification 
illusion: distinct domains of roughly parallel lines are present, each domain having 
a specific angle with respect to the adjacent domain. Our simulations led us predict, 
that the zig-zag pattern perceived by individuals suffering from classical migraine 
reflects the cortical organisation of their visual cortical orientation map. 

The pattern in Fig. 6 is created by using the orientation information of a 
certain area and transforming them back to a representation by bars of the calcu- 
lated average orientation. An important question is how a CSD wave can possibly 
produce a signal accessible to perception. Cortical Spreading Depression is charac- 
terised by a transient high amplitude signal in all major ions, that develops into an 
intercellular wave co-ordinating the cellular responses within the grey matter. At 
the plateau of the CSD attack, the affected neurons are in the state which makes 
their participation in signal processing unlikely. Therefore this phgise presumably 
corresponds to the scotoma in the back of the zig-zag front, and the zig-zag front 
itself corresponds to the onset of CSD. We suggest that the fortification illusion can 
be reproduced by assuming that a continuous smooth CSD-like excitation front 
propagates across the primary visual cortex. 
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Figure 6. Simulation of the fortification illusion. Multiple bars 
with white margins were plotted in the order of their dominance. 
The pattern resembles a snapshot of a fraction of the moving zig- 
zag front. 
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5. Perspectives 

The concept of dynamical diseases is based on the temporal and spatio-temporal 
dynamics of biological processes, either at the cellular or the organ level or both. 
Since the dynamics often implies oscillations in space and time, the fundamental 
tools for the analysis of nonlinear processes in physics and chemistry can and have 
been applied for a better understanding of dynamical diseases. Due to this concept, 
the knowledge of the system properties is a major requirement for the control of 
its dynamics and thereby the treatment of diseases. More and more examples from 
the clinical practise demonstrate the validity of this concept. 

The simulation of the fortification illusion during a migraine attack is solely 
based on the idea of a traveling reaction-diffusion wave across the primary vi- 
sual cortex, i. e. a nonlinear phenomenon, and the functional behaviour of the 
underlying tissue. The results demonstrate the powerful possibilities of nonlinear 
dynamics. From a biomedical point of view it is of primary interest to control the 
propagation dynamics of the waves and/or to suppress their generation in order 
to prevent the migraine attacks. Such an external control should be possible by 
means of the well established methods from nonlinear dynamics, e. g. feed back 
control [44]. However, some knowledge about the basic mechanisms of spreading 
depression is required in order to apply such methods. The nature of the activating 
and of the inhibiting processes as well as the feed back loops for the oscillatory re- 
action must be known. Therefore, deeper insights into the biochemical mechanisms 
of SD are necessary that determine the function and regulation of this complex 
phenomenon on a cellular level. 

A further interesting point for future analysis of the visual hallucinations 
is to characterise the dynamical properties of the kaleidoscopic change, as it was 
called by Lashley [17], while the fortification illusion is propagating through the 
visual field. According to our explanation of the origin of the fortification illu- 
sion, some bars should change their orientation clockwise for a certain period of 
time while others should rotate counter clockwise. To visualise this dynamics of our 
simulated fortification illusion, we created a MPEG-movie (http://www.migraene- 
aura.de/simulation/dynamic_fi.html) by calculating single images of the aura as 
seen in Fig. 6. If such transitions could be seen by migraine sufferers this would be 
a further indication for our model of the classical migraine aura. Having this infor- 
mation at hand, it should be possible to search for improved methods to suppress 
the appearance of the fortification illusion by means of established methods for 
the controlled movement of reaction-diffusion waves. Hopefully, such treatments 
would also prevent migraine attacks. 
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Figure 3 from page 107. 

Periodic intercellular calcium waves in an array of heterogeneous cells. (A) Intrinsic frequencies of the 
uncoupled cells. The frequency varies according to the activated agonist receptor in each cell, drawn from a 
normal distribution (R = 0.6 ± 0.05). Cell 1 is assigned a higher value {R = 0.8). (B) Frequency histogram. (C) 
Space time plot of the cytosolic calcium concentration if the cells are uncoupled = Pw ~ 0-^ ywm/s); (D) if the 
cells are coupled (P^ = 0.6 //m/s, P^= 1.5 //m/s). = 22.5 //m^/s, v„ = 0.05 //M/s, the other parameters as in 

Table 1 (p. 103). 
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Figure 1 from page 154. 

Cartoons of the filaments and motor structures. 
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Figure 3 from page 163. 

Highly simplified two-dimensional energy landscape for a motor protein with a chemical and a spatial axis 
(arrows). An external load would tilt this landscape in the spatial direction, a chemical non-equilibrium in the 
chemical direction. Mechanochemical coupling is reflected in the diagonal path (red line); a particle (red sphere) 
would advance along the spatial axis even with a purely „chemical tilt“. 
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Figure 4 from page 167. 

Schemes of the hand-over-hand mechanism (upper) and its translation in terms of a two-state model (lower). 
The different phases of the coordinated activity of the motor domains are reduced to the motion of a stochastic 
particle at position x and in chemical states 1 or 2 (see text). The phase A corresponds to the transition driven by 
the ATP hydrolysis (hydrolysis and energy release), while the phase B to the sliding of the particle over the 
potential profile (conformational change). 
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Figure 1 from page 228. 

Model of the enzyme as a protein machine. 




Figure 1 from page 243. 

Map showing the problem of the Bridges of Konigsberg [20] which gave birth to the foundation of graph theory 
as a mathematical discipline (a). The same problem represented in terms of a graph G and its associated spectral 
density for y= 0.08 (b). 
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Figure 2 from page 272. 

Equilibrium sorting patterns in mixtures of embryonic chicken tissues. The surface tension of each tissue is 
indicated on the right in units of dyne/cm. 




Figure 3 from page 273. 

Sorting in genetically transformed cell populations with controlled number of cell adhesion molecules. See text 
for details. The linear size of the aggregate is around 200 //m. 
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Figure 2 from page 281 . 

Pseudo- ID-triangulation with model cell. Open circles denote binding sites, the rectangular box represents a 
model cell. In this configuration there are no fluxes in the y-direction. 




Figure 3 from page 282. 

Force-stress relation. A: The force on a single binding site depends linearly on the stress in the underlying 
matrix. B: Two neighbour cells pull stronger on the side where they face each other (black arrows: forces on 
each single bindig side, arrows below the cell: net force for the left and right side, color of the matrix: V,^). 
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Figure 4 from page 283. 

Spontaneous cell motion in ID. A: Plot of the matrix density ip-pQ) for the cell given below. To visualise the 
stochastic effect of the distribution of binding sites the cell is blocked in movement. As a result the forces are 
imbalanced and it pulls harder on the right site which causes an asymmetric density profile (Vertical lines in the 
density plot indicate the cell boundaries for the given cell below, arrows denote the net forces for the left and 
right side, coulor: matrix density). B: Inclusion of cell motion counter-balances the asymmetry. 




Figure 6 from page 285. 

Tensiotaxis mediated aggregation in 2D. A: Attraction of two cells in 2D. Cells now are represented by circles 
with a lamella region and traction forces (black arrows, only shown for the first time step) point to the cell 
centre. The colour of the matrix codes the stress. Maximum tension can be found in between cells and like in ID 
cells move towards each other. B: Same situation as in A, but now with six cells. To exclude effects caused by 
cells coming to close to the domain boundary, a larger multi resolution grid is used. Black arrows here denote 
the matrix velocity. 
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Figure 3 from page 315. 

Expression dynamics of HyTcf ^ HyWnt, and HyBral during aggregate development. In situ hybridization 
reveals patterning events during head organizer formation. HyWnt and HyBral appear simultaneously in small 
spots (24h) which enlarge during later stages (96h), and precede formation of morphological head structures by 
about 2-3 days. All spots eventually develop into heads. (From Proceedings of the National Academy of 
Sciences USA 97: 12127-12131 and Nature 407: 186-189; Nature Publishing Group, 2000; PNAS 2000). 
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Figure 1 from page 325. 

(A) the freshwater polyp hydra. (B) With the 
inner endodermal layer and the outer ecto- 
dermal layer, the basic body plan is close to a 
gastrula. (C) A drawing from Haeckel’s paper 
[3], proposing that all higher organisms 
proceed through a similar-appearing gastru- 
lalike stage. (D-F) The oral-aboral axis of 
hydra and the proposed correspondence the 
anteroposterior axis in higher organisms. The 
foot of the hydra is proposed to be derived 
from the bottom of the cup-shaped ancient 
gastrula. This part of the ancestor gave rise to 
the most anterior part of verebrates, the for- 
brain and heart (Nkx2.x expression, pink). In 
contrast, the opening of the cup gave rise to 
the mouth opening in hydra and to the anus 
of higher organisms (^A7^expression). The 
region around the tentacles corresponds to 
the midbrain-hindbrain border (border of 
Otx-Gsc expression, (blue/yellow). In verte- 
brates (F), Gsc, (yellow) participates in head 
formation while Brachyury^ (red) forms the 
notochord and the tail bud. In hydra both 
genes are expressed next to each other. Thus, 
this is the zone that gave rise to the trunk (for 
details, see [6], for actual expression patterns 
in hydra [13-17]). 
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Figure 3 from page 330. 

Generation of complex patterns by linkage of 
several pattern forming reactions: Simulation 
of hypostome, tentacle and foot formation in 
hydra. Primary head activation (green) and 
foot activation (pink) apppear at opposite 
ends of the field due to a coupling via the 
source density (or competence, blue). Ten- 
tacle activation (red) appears close to the 
hypostome since it requires a high source 
density but it is locally suppressed by head 
activation. 

Bottom right: In regeneration near-head frag- 
ments tentacle activation precedes head acti- 
vation. The new tentacle signal appears first 
at the tip, the region of the highest source 
density. Later, with the rising head signal, it 
is shifted to the appropriate position, in 
agreement with the experimental observa- 
tions [13]. 

Bottom left: In more basal fragments head 
activation occurs first. Tentacle activation 
takes place later at the final position after the 
source density has obtained a threshold value 
(from [11, 18]). 
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Figure 5 from page 342. 

Larval skull and cranial musculature of Bomhina oricntalis, depicted in dorsal (left) and ventral views. Neural 
crest-derived cartilages are shaded according to the migratory stream from which they originate (redrawn form 
Olsson and Hanken, 1996): light brown, mandibular stream; medium brown, hyoid stream; dark brown, bran- 
chial stream. The few non-crest-derived cartilages are light grey. Cranial muscles arc depicted schematically. 
Mandibular (first) arch muscles are red, hyoid (second) arch muscles are blue. Paired muscles are depicted on 
one side only. Cartilages: BB, basibranchial; BH basihyal; CB, ceratobranchials I-IV; CH, ceratohyal; CT, 
cornua trabecula (trabecular horn); IR, infrarostral; MC Meckel’s; OC, otic capsule; PQ, palatoquadrate; SR, 
suprarostral; TP, trabecular plate. Muscles: lev, levator mandibulae group-mlma, levator mandibulae anterior; 
mlmaa, levator mandibulae anterior articularis; mlmas, levator mandibulae anterior subexternus; mlmp, levator 
mandibulae posterior (comprising two parts; superficialis and profundus); ang, angularis group-mha, hyoan- 
gularis; mqa, quadratoangularis; msa, suspensorioangularis; hyoideus group-mih, interhyoideus; moh, orbito- 
hyoideus; msh, suspensoriohyoideus; osh, orbito- and suspensoriohyoideus; others-mia, intermandibularis 
anterior; mip, intermandibularis posterior; mml, mandibulolabioalis. Anatomical nomenclature follows 
Cannatella [2]. 
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Figure 6 from 343. 

A larva of the Mexican axolotl stained with antibodies specifically recognising muscle (green) and cranial 
nerves (red). The eyes are autofluorescent in both green and red and therefore appear yellow. Following 
immunostaining, the larva was bleached and made transparent, and subsequently imaged using a confocal 
microscope. This allows optical sectioning through the larval head. The optical sections were then put together 
and all parts of the larva are in focus at the same time. The red dots (there are many of them ventrally) are cilia, 
not nerves. Both cranial nerves and cilia contain the molecule (acetylated alpha-tubulin) recognised by the 
antibody. Muscles were stained using a desmin antibody. 




Figure 1 from page 385. 

Homogeneous tumour results, spatial distri- 
bution of tumour cells, MDE, MM and 
oxygen (clockwise) at time t = 200 units (i.e. 
200 generations, approximately 133 days). 
Colouration of the tumour cells represents 
phenotype: orange (type I); yellow (type II); 
green (type III); blue (type IV) and brown 
represents dead cells. For the MDE, MM and 
oxygen concentration, the hot colour map is 
used, i.e. white = high concentration, black = 
low concentration and red is in between. 
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Figure 2 from page 386. 

Heterogeneous tumour results, spatial distribution of tumour eells, MDE, MM, and oxygen (cloekwise) at time 
t = 200 units. Coloration as in Figure 2. 
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Figure 3 from page 387. 

Random tumour results, spatial distribution of tumour cells, MDE, MM and oxygen (clockwise) at time t = 200 
units. Coloration as in Figure 2. 
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Figure 2 of page 413. 

Fusion of HA-expressing cell with douhle-laheled red blood cell. RBCs were double-labeled with a green 
fluorescent aqueous dye (cytoplasm) and a red fluorescent lipid-like dye (membrane). RBC was bound to an 
HA-expressing cells (A-C phase microscopy; D-I fluorescence microscopy). At time 0 min (A,D,G), fusion was 
initiated by lowering the pH from 7.4 to 5.0. Both fluorophores are still confined to the RBC. Images taken after 
1 min 39 s (B,E,H), when the lipid fluorophore (H) but not the aqueous dye (E) starts moving into the 
fibroblasts (LP-signal). This signal is preceded by a sudden drop of the fluorescence of the lipid dye due to the 
formation of an early ion-permissive pore (IP-signal) (not visible, see text). Images taken after 5 min 2 s (C,F,I), 
when aqueous dye has redistributed to the cytoplasm (F), while the lipid fluorophore is spreading over the 
surface of the plasma membrane of the HA expressing cell (I). Experimental data in Figure 5 (p. 419) 
corresponds to intensities measured in region 3 (IP-signal) and region 4 (LP-signal). Reproduced from The 
Journal of Cell Biology [4] by copyright permission of the Rockefeller University Press. 



